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FUZZY GENES AND THEIR APPLICATION IN IMPROVED MOLECULAR 

ADJUVANTS 

This invention relates to novel genetic constructs designed to permit expression or 
synthesis of multidomain proteins containing extended repetitive sequences, particularly 
those useful for the creation of molecular adjuvants and immunogens. 

The complement system consists of a set of serum proteins that are important in 
the response of the immune system to foreign antigens. The complement system 
becomes activated when its primary components are cleaved and the products, alone or 
with other proteins, activate additional complement proteins resulting in a proteolytic 
cascade. Activation of the complement system leads to a variety of responses including 
increased vascular permeability, chemotaxis of phagocytic cells, activation of 
inflammatory cells, opsonisation of foreign particles, direct killing of cells and tissue 
damage. Activation of the complement system may be triggered by antigen-antibody 
complexes (the classical pathway) or a normal slow activation may be amplified in the 
presence of cell walls of invading organisms such as bacteria and viruses (the alternative 
pathway). The complement system interacts with the cellular immune system through a 
specific pathway involving C3, a protein central to both classical and alternative 
pathways. The proteolytic activation of C3 gives rise to a large fragment (C3b) and 
exposes a chemically reactive internal thiolester linkage which can react covalently with 
external nucleophiles such as the cell surface proteins of invading organisms or foreign 
cells. As a result, the potential antigen is 'tagged' with C3b and remains attached to that 
protein as it undergoes further proteolysis firstly to iC3b and then to C3d,g & C3d, iC3b 
is a ligand for complement receptor CR3 and the latter fragments are ligands for CR2. 
Thus the labelling of antigen by C3b can result in a targeting mechanism for cells of the 
immune system bearing these receptors. 

That such targeting is important for augmentation of the immune response is first 
shown by experiments in which mice were depleted of circulating C3 and then 
challenged with an antigen (sheep erythrocytes). Removal of C3 reduced the antibody 
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response to this antigen. (M.B. Pepys, LExp.Med, 140, 126-145, 1974). The role of C3 
was confirmed by studies in animals genetically deficient in either C3 or the upstream 
components of the complement cascade which generate C3b, i.e. C2 and C4, (J.M. 
Ahearn & D.T. Fearon, Adv.Immunol. 46* 1 83-21 9, 1 989). More recently, it has been 
shown that linear conjugation of a model antigen with more than two copies of the 
murine C3d fragment sequence resulted in a very large (1000-10000-fold) increase in 
antibody response in mice compared with unmodified antigen controls (P.W.Dempsey et 
at, Science, 271: 348-350, 1996; W096/17625, PCT/GB95/02851). The increase could 
be produced without the use of conventional adjuvants such as Freund's complete 
adjuvant. The mechanism of this remarkable effect was demonstrated to be high-affinity 
binding of the multivalent C3d construct to CR2 on B-cells, followed by co-ligation of 
CR2 with another B-cell membrane protein, CD 19 and with membrane-bound 
immunoglobulin to generate a signal to the B-cell nucleus. 

In these experiments, the unmodified antigen control and linear fusions with one 
or two C3d domains were prepared by transfection of the appropriate coding plasmids 
into L cells followed by the selection of high-expressing clones. The most immunogenic 
construct, that with three C3d units, had to be expressed transiently in COS cells and this 
procedure gave a very poor yield of the fusion protein. In part, the low yield could be 
attributed to the generation of species containing the antigen but with lower molecular 
weights, corresponding to fewer than three C3d units. It was unclear from the published 
work of Dempsey et al whether the latter molecules originated by proteolysis of the 
three- C3d construct or whether they were due to a recombination event in vivo. 

Using another expression system but the same C3d constructs as Dempsey et al t 
we have now obtained evidence that the generation of molecules with <3 C3d units from 
DNA encoding 3x C3d repeats is due to loss of one or more C3d units by homologous 
recombination and not due to post-translational processing (see below). This 
observation has also identified an efficient system for the expression of the C3d 
monomer. 
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It is known generally that the production of high molecular weight polypeptides 
containing multiple repeating sequences is difficult because of the tendency of repeated 
DNA sequences to undergo rearrangement during replication. Some of the limitations 
on internal repetitiveness in plasmids have been discussed by Gupta (Bio/Technology 1, 
602-609, 1983). Ferrari et al ( US 5,641,648) have described methods for expression of 
repetitive sequences using synthetic genes constructed from monomeric units which are 
concatenated by ligation. DNA sequences encoding the same repeated amino acid 
sequence but differing in nucleotide sequence either within or between monomers were 
constructed by exploiting the redundancy of the genetic code. The resulting lack of 
precise repetitiveness at the nucleotide level reduced homologous recombination to the 
point where the repeated oligopeptide sequence could be expressed. The work of Ferrari 
et al was restricted to relatively short repeating units of 4 to 30 codons (amino-acids) 
repeated a large number of times (typically ~ 30-fold). 

The present invention describes a general method for introducing maximum 
variability into entire genes or fragments of genes, particularly those encoding 
autonomously folding protein domains or motifs of >30 amino acids, in such a way that 
different DNA units encoding identical or near-identical ammo acid sequences can be 
concatenated and expressed to give domain oligomers. 

The invention comprises the following elements: 

1. The construction of novel synthetic DNA sequences encoding an autonomously 
folding protein domain and using in these DNA sequences the maximum third - 
base redundancy in each codon permitted by the genetic code which is consistent 
with a continuous reading frame and retention of the amino acid sequence. These 
mixtures of DNA molecules are termed < Fuzzy Genes'. 

2. Assembling these genes into fuzzy or partially-fuzzy concatamers with or without 
in-frame coding regions for other proteins. 'Partially-fuzzy' means a construct in 
which some but not all regions of the DNA have third-base variability. 
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3. Placing this library of sequences into a suitable expression vector and expressing 
the population in a recombinant host cell. 

4\ The use of assays able to detect the presence of repeated-domain expressed 

protein products. Host cell clones are screened for those capable of producing 
useful levels of functionally active polypeptide concatamers / fusion proteins . 

5. Characterising one or more unique DNA sequences derived from these clones and 
encoding the expressed product. Such DNA sequences will generally be novel as 
they have been selected from a highly diverse fuzzy-DNA pool. 

In specific embodiments of the invention, the autonomously folding protein domain is a 
ligand for one or more cell surface receptors involved in the regulation of the immune 
system. One such example is human or murine C3d or C3d,g polypeptide sequence or 
another peptide ligand of CR2 (CD21) or CD 19. 

In a second embodiment, the repeated fuzzy or partially fuzzy domain may be an 
immunogen, particularly an antigenic protein or region of a protein. Examples of 
polypeptide irnrnunogens include but are not restricted to: the Hepatitis B surface antigen, 
meningococcal surface proteins, proteins expressed at various stages of the life cycle of 
the malaria parasite, the glu can-binding region of streptococcal glucosyltransferases, the 
haemagglutinin (H) and neuraminidase (N) proteins of influenza virus strains and the D- 
repeat regions of the fibronectin binding proteins of staphylococci . 

Optionally, an antigen oligomer may be fused to a C3d oligomer, either or both 
component being expressed from fuzzy or partially fuzzy genes. 

In a third embodiment, the expressed oligomeric protein may be derivatised to facilitate 
post-translational linkage to an antigen or other protein. Preferably, such derivatisation is 
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effected by engineering a reactive residue such as a free cysteine or a thiolester group at a 
unique site in the oligomer, preferably at the C- or N- terminus. 

In a further aspect, the invention also provides for expression of closely related 
polypeptides in a single linear molecule by the ligation of fuzzy DNA sequences 
encoding near-identical amino acid sequences. In this context, near- identical signifies 
sequences differing by at least one amino acid but not in more than 10% of the total 
number of amino acids. Examples of such constructs include but are not restricted to 
genes encoding several variations of a protein or antigen and concatenated 
immunoglobulin single-chain F v fragments with a similar overall architecture but 
containing small variations in the complementarity-determining regions so that they 
recognise different antigens. 

Another embodiment of the invention utilises the novel DNA sequences identified by the 
selection process noted above as components of expression vectors for genetic (or DNA) 
immunisation. In this application, the preferred DNA sequences for expression in a given 
cell type (such as a human cell line) are identified by screening cells of that type 
trans fee ted with a fuzzy or partially fuzzy DNA pool (within a suitable vector) for 
expression of the desired construct. This application may be further extended by 
chemically linking the expressed, derivatised C3d (or other protein) oligomers to DNA- 
binding molecules such as cationic lipids, lipopeptides or liposomes so that vectors for 
DNA immunisation may be targeted to particular cell types. Thus, for example, a C3d 
trimer linked to a liposome containing DNA encoding a (C3d)3-antigen fusion could be 
targeted to dendritic cells. Expression of the construct by the dendritic cells could then 
present a targeted antigen locally to further B -lineage cells thus achieving a dual -level 
selectivity. 
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The invention provides a process for preparing oligomeric polypeptides according to the 
invention which process comprises expressing DNA encoding said polypeptide in a 
recombinant host cell and recovering the product. That process may comprise the steps 
of: 

(i) preparing a variable (fuzzy) replicable expression vector capable, in a host cell, of 
expressing a DNA polymer comprising a nucleotide sequence that encodes said 
polypeptide; 

ii) transforming a host cell with said vector; 

iii) culturing said transformed host cell under conditions permitting expression of said 
DNA polymer to produce said polypeptide; and 

iv) recovering said polypeptide in an active form. 

The fuzzy DNA polymer comprising a nucleotide sequence that encodes the polypeptide 
also forms part of the invention. 

The process of the invention may be performed using conventional recombinant 
techniques such as described in Sambrook et aL, Molecular Cloning : A laboratory 
manual 2nd Edition. Cold Spring Harbor Laboratory Press (1989) and DNA Cloning 
vols I, II and III (D. M. Glover ed., IRL Press Ltd). 

The invention also provides a process for preparing the DNA polymer by the 
condensation of appropriate mono-, di- or oligomeric nucleotide units. 

The preparation may be carried out chemically, enzyrnatically, or by a combination of the 
two methods, in vitro or in vivo as appropriate. Thus, the DNA polymer may be prepared 
by the enzymatic ligation of appropriate DNA fragments, by conventional methods such 
as those described by D. M. Roberts et a!., in Biochemistry 1985, 24, 5090-5098. 

The DNA fragments may be obtained by digestion of DNA containing the required 
sequences of nucleotides with appropriate restriction enzymes, by chemical synthesis, by 
enzymatic polymerisation, or by a combination of these methods. 
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Digestion with restriction enzymes may be performed in an appropriate buffer at a 
temperature of 20°-70°C, generally in a volume of 50|al or less with 0.1-10p.g DNA. 
Enzymatic polymerisation of DNA may be carried out in vitro using a DNA polymerase 
such as DNA polymerase 1 (Klenow fragment) in an appropriate buffer containing the 
nucleoside triphosphates dATP, dCTP, dGTP and dTTP as required at a temperature of 
10°-37°C, generally in a volume of 50ul or less. Enzymatic ligation of DNA fragments 
may be carried out using a DNA ligase such as T4 DNA ligase in an appropriate buffer at 
a temperature of 4°C to 37°C, generally in a volume of 50ul or less. 

The chemical synthesis of the DNA polymer or fragments may be carried out by 
conventional phosphotriester, phosphite or phosphoramidite chemistry, using solid phase 
techniques such as those described in 'Chemical and Enzymatic Synthesis of Gene 
Fragments - A Laboratory Manual* (ed. H.G. Gassen and A. Lang), Verlag Chemie, 
Weinheim (1982), or in other scientific publications, for example M.J. Gait, H.W.D. 
Matthes M. Singh, B.S. Sproat and R.C. Titmas, Nucleic Acids Research, 1982, 10, 6243, 
B.S. Sproat and W. Bannwarth, Tetrahedron Letters, 1 983, 24, 5771 ; M.D. Matteucci 
and M.H. Caruthers, Tetrahedron Letters, 1980, 21, 719; M.D. Matteucci and M.H. 
Caruthers, Journal of the American Chemical Society, 1981, 103, 3185; S.P. Adams et 
aL, Journal of the American Chemical Society, 1983, 105, 661; N.D. Sinha, J. Biernat, J. 
McMannus and H. Koester, Nucleic Acids Research, 1984, 12, 4539; and H.W.D. 
Matthes et al., EMBO Journal, 1984, 3, 801. Preferably an automated DNA synthesiser 
(for example, Applied Biosystems 381 A Synthesiser) is employed. 

The DNA polymer is preferably prepared by hgating two or more DNA molecules which 
together comprise a DNA sequence encoding the polypeptide. The DNA molecules may 
be obtained by the digestion with suitable restriction enzymes of vectors carrying the 
required coding sequences. 



The precise structure of the DNA molecules and the way in which they are obtained 
depends upon the structure of the desired product. The DNA molecule encoding the 
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polypeptide may be constructed using a variety of methods including chemical synthesis 
of DNA oligonucleotides, enzymatic polymerisation, restriction enzyme digestion and 
ligation. The derivation of a suitable strategy for construction of the DNA molecule by 
such techniques is a routine matter for the skilled worker in the art. 

The systematic variation of third-base usage is described in more detail below (Example 
3, Table 1) and additional consideration may be given to the avoidance of rarely used 
codons of the particular host cell. 

The expression of the DNA polymer encoding the polypeptide in a recombinant host cell 
may be carried out by means of a replicable expression vector capable, in the host cell, of 
expressing the DNA polymer. The expression vector is novel and also forms part of the 
invention. 

The replicable expression vector may be prepared in accordance with the invention, by 
cleaving a vector compatible with the host cell to provide a linear DNA segment having 
an intact replicon, and combining said linear segment with one or more DNA molecules 
which, together with said linear segment, encode the polypeptide, under ligating 



The ligation of the linear segment and more than one DNA molecule may be carried out 
simultaneously or sequentially as desired. Thus, the DNA polymer may be preformed or 
formed during the construction of the vector, as desired. The choice of vector will be 
determined in part by the host cell, which may be prokaryotic, such as E, coli, 
mammalian, such as mouse CI 27, mouse myeloma, Chinese hamster ovary, or another 
eukaryote (e.g. filamentous fungi or unicellular yeast or an insect cell such as Drosophila 
or Spodoptera). The host cell may also be in a transgenic animal. Suitable vectors 
include plasmids, bacteriophages, cosmids and recombinant viruses derived from, for 
example, baculo viruses or vaccinia. 



conditions. 
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The DNA polymer may be assembled into vectors designed for isolation of stable 
transformed mammalian cell lines expressing the fragment e.g. bovine papillomavirus 
vectors in mouse C127 cells, or amplified vectors in Chinese hamster ovary cells (DNA 
Cloning Vol. II D.M. Glover ed. IRL Press 1985; Kaufman, R.J. et al.. Molecular and 
Cellular Biology 5, 1750-1759, 1985, Pavlakis G.N. and Hamer, D.H. Proceedings of 
the National Academy of Sciences (USA) 80, 397-401, 1983; Goeddel, D.V. et al 
European Patent Application No. 0093619, 1983). 



The preparation of the replicable expression vector may be carried out conventionally 
with appropriate enzymes for restriction, polymerisation and ligation of the DNA, by 
procedures described in, for example, Sambrook et al. T cited above. Polymerisation and 
ligation may be performed as described above for the preparation of the DNA polymer. 
Digestion with restriction enzymes may be performed in an appropriate buffer at a 
temperature of 20°-70°C, generally in a volume of 50^1 or less with 0.1-10jag DNA. 

The recombinant host cell is prepared, in accordance with the invention, by transforming 
a host cell with a replicable expression vector of the invention under transfomiing 
conditions. Suitable transforming conditions are conventional and are described in, for 
example, Sambrook et al., cited above, or "DNA Cloning" Vol. II, D.M. Glover ed., IRL 
Press Ltd, 1985. 

The choice of transforming conditions is determined by the host cell. Thus, a bacterial 
host such as E. coli, may be treated with a solution of CaCl2 (Cohen et aL, Proc. Nat. 

Acad. Sci., 1973, 69, 21 10) or with a solution comprising a mixture of RbCl, MnClj, 
potassium acetate and glycerol, and then with 3-[N-morpholino]-propane-sulphonic acid, 
RbCl and glycerol or by electroporation as for example described by Bio-Rad 
Laboratories, Richmond, California, USA, manufacturers of an electroporator. 
Eukaryotic cells in culture may be transformed by calcium co-precipitation of the vector 
DNA onto the cells or by using cationic liposomes. 
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The invention also extends to a host cell transformed with a variable replicable 
expression vector of the invention. 

Culturing the transformed host cell under conditions permitting expression of the DNA 
polymer is carried out conventionally, as described in, for example, Sambrook et ah, and 
"DNA Cloning" cited above. Thus, preferably the cell is supplied with nutnent and 
cultured at a temperature below 45 °C. 

The protein product is recovered by conventional methods according to the host cell. 
Thus, where the host cell is bacterial such as E. coli and the protein is expressed 
intracellularly, it may be lysed physically, chemically or enzymalically and the protein 
product isolated from the resulting lysate. Where the host cel3 is eukaryotic, the product 
is usually isolated from the nutrient medium. 

Where the host cell is bacterial, such as E. coli, the product obtained from the culture 
may require folding for optimum functional activity. This is most likely if the protein is 
expressed as inclusion bodies. There are a number of aspects of the isolation and folding 
process that are regarded as important. In particular, the polypeptide is preferably 
partially purified before folding, in order to minimise formation of aggregates with 
contaminating proteins and minimise misfolding of the polypeptide. Thus, the removal 
of contaminating E, coli proteins by specifically isolating the inclusion bodies and the 
subsequent additional purification pnor to folding are important aspects of the procedure. 

The folding process is carried out in such a way as to minimise aggregation of 
intermediate-folded states of the polypeptide. Thus, careful consideration needs to be 
given to, among others, the salt type and concentration, temperature, protein 
concentration, redox buffer concentrations and duration of folding. The exact condition 
for any given polypeptide generally cannot be predicted and must be determined by 
experiment. 



10 
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There are numerous methods available for the folding of proteins from inclusion bodies 
and these are known to the skilled worker in this field. The methods generally involve 
breaking all the disulphide bonds in the inclusion body, for example with 50mM 2- 
mercaptoethanol, in the presence of a high concentration of denaturant such as 8M urea 
or 6M guanidine hydrochloride. The next step is to remove these agents to allow folding 
of the proteins to occur. Formation of the disulphide bridges requires an oxidising 
environment and this may be provided in a number of ways, for example by air, or by 
incorporating a suitable redox system, for example a mixture of reduced and oxidised 
glutathione. 

Preferably, the inclusion body is solubilised using 8M urea, in the presence of 
mercaptoethanol, and protein is folded, after initial removal of contaminating proteins, by 
addition of cold buffer. A preferred buffer is 60rmM ethanolamme containing ImM 
reduced glutathione and 0.5mM oxidised glutathione. Ideally, the nature of the buffer is 
determined experimentally in order to obtain a protein product that is functionally active. 

The folding is preferably carried out at a temperature in the range 1 to 5°C over a period 
of 1 to 4 days. 

If any precipitation or aggregation is observed, the aggregated protein can be removed in 
a number of ways, for example by centrifugation or by treatment with precipitants such 
as ammonium sulphate. 

The polypeptide portion of the derivative of the invention may include a C-terminal 
cysteine to facilitate post- trans lational modification. Expression in a bacterial system is 
preferred for some proteins of moderate size (up to ~70kDa) and with <~8 disulphide 
bridges. More complex proteins for which a free terminal cysteine could cause refolding 
or stability problems may require expression in. eukaryotic cells. 

The use of insect cells infected with recombinant baculovirus encoding the polypeptide 
portion is a preferred general method for preparing more complex proteins, particularly 
the C3d oligomers of the invention. 
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A preferred method of handling proteins derivatised with cysteine is as a mixed 

di sulphide with mercaptoethanol or glutathione or as the 2-nitro, 5-carboxyphenyl thio- 

derivative as generally described below. 

Where the oligomeric polypeptide derivative of the invention includes a single cysteine, 
chemical ligation to a second polypeptide containing a unique cysteine may be employed. 
The bridge is generated by conventional disulphide exchange chemistry, by activating a 
thiol on one polypeptide and reacting the activated thiol with a free thiol on the other 
polypeptide. Such activation procedures make use of disulphides which form stable 
thiolate anions upon cleavage of the S-S linkage and include reagents such as 2,2' 
dithiopyridine and 5,5'-dithiobis-(2-nitrobenzoic acid) (DTNB) that form intermediate 
mixed disulphides capable of further reaction with thiols to give stable disulphide 
linkages. One polypeptide activated in this way is then reacted with the second 
containing the free thiol. The precise conditions of pH , temperature, buffer and reaction 
time will depend on the nature of the reagent used and the polypeptide to be modified. 
The polypeptide linkage reaction is preferably carried out by mixing the modified 
polypeptides in neutral buffer in an approximately equimolar ratio. The reaction should 
preferably be carried out under an atmosphere of nitrogen. Preferably, UV-active 
products are produced (e.g. from the release of pyridine 2-thione from 2-pyridyl dithio 
derivatives) so that coupling can be monitored. 

After the linkage reaction, the polypeptide conjugate can be isolated by a number of 
chromatographic procedures such as gel filtration, ion-exchange chromatography, affinity 
chromatography or hydrophobic interaction chromatography. These procedures may be 
either low pressure or high performance variants. 

The conjugate may be characterised by a number of techniques including low pressure or 
high performance gel filtration, SDS polyacrylamide gel electrophoresis or isoelectric 
focussing. 
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In a further aspect, therefore, the invention provides a process for preparing a derivative 
according to the invention, this process comprises expressing DNA encoding the 



recovering the product and thereafter post-translationally linking the polypeptide to a 
derivatised antigen or other polypeptide. 

GENERAL METHODS USED IN EXAMPLES 

(i) DNA Cleavage 

Cleavage of DNA by restriction endonucleases was carried out according to the 
manufacturer's instructions using supplied buffers (New England Biolabs (U.K.) Ltd., 
Herts, or Pro mega Ltd., Hants, UK). Double digests were carried out simultaneously if 
the buffer conditions were suitable for both enzymes. Otherwise double digests were 
carried out sequentially where the enzyme requiring the lowest salt condition was added 
first to the digest. Once the digest was complete the salt concentration was altered and 
the second enzyme added. 

(ii) DNA ligation 

Ligations were earned out using T4 DNA ligase purchased from Promega or New 
England Biolabs as described in Sambrook et al, (1989) Molecular Cloning: A 
Laboratory Manual 2nd Edition, Cold Spring Harbor Laboratory Press. 

(in) Plasmid isolation 

Pi as mi ds were isolated using Wizard™ Plus Minipreps (Promega) or Qiex mini 
or midi kits and Qiagen Plasmid Maxi kit (QIAGEN, Surrey) according to the 
manufacturer's instructions. 

Plasmid pSG.C3di.YL encoding C3d monomer and plasmid pSG.(C3d) 3 .YL 
encoding C3d trimer were kindly provided by Professor D.T. Fearon, University of 



oligomeric polypeptide portion of said derivative in a recombinant host cell and 



13 



12-97 14:28 01 223 3E02B0 P M R"067 Job-2EB 

DEC-1997 14 = 40 REmiE RND GROSE 01223 360280 



14 



Cambridge. All C3d constructs and their products derived from these plasmids are 
terminated with the epitope tag Glu-Ghi-Phe. 

(iv) DNA fragment isolation 

DNA fragments were excised from agarose gels and DNA extracted using the 
QIAEX gel extraction kit or Qiaquick (QIAGEN, Surrey, UK), or GeneClean, or 
GeneClean Spin Kit or MERmaid Kit, or MBRmaid Spin Kit (Bio 101 Inc, CA. USA) gel 
extraction kits according to the manufacturer's instructions. 

(v) Introduction of DNA into E. coli 

Plasmids were transformed into competent E. coli BL21(DE3) or XLl-blue 
strains (Studier and Moffat, (1986), J. Mol. Biol J 89:1 13). The E. coli strains were 
purchased as a frozen competent cultures from Stratagene (Cambridge, UK). 

(vi) DNA sequencing 

The sequences were analysed by a Perkin Elmer ABI Prism 373 DNA Sequencer. 
This is an electrophoretic technique using 36 cm x 0.2mm 4% acrylamide gels, the 
fluorescently labeled DNA fragments being detected by a charge coupled device camera 
according to the manufacturer's instructions. 



(vii) Production of oligonucleotides 

Oligonucleotides were purchased from Cruachem, Gla$gow,UK 

(viii) Generation of baculovirus vectors 

Plasmids described in this invention having the prefix pBP (e.g. pBP68-01 
described below) are used to generate baculovirus vectors and express the encoded 
recombinant polypeptides by the following methods (Sections (viii) to (x)). 
Purified plasmid DNA was used to generate recombinant baculo viruses using the kit 4 The 
BacPak Baculovirus Expression System' according to the manufacturer's protocols 
(Clontech, CA, USA). The insect cell line Sf9 (ATCC) was grown in IPL-41 medium 
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(Sigma, Dorset, UK) supplemented according to manufacturers recommendations with 
yeast extract, lipids and pluronic F68 (all from Sigma) and 1% (v/v) foetal calf serum 
(Gibco, Paisley, UK) - this is termed growth medium. Cells were transfected with the 
linearised baculo virus DNA (supplied in the kit) and the purified plasmid. Plaque assays 
(see method below) were carried out on culture supernatants and a series of ten- fold 
dilutions thereof to allow isolation of single plaques. Plaques were picked using glass 
Pasteur pipettes and transferred into 0.5ml aliquots of growth medium. This is the 
primary seed stock. 

(ix) Plaque assay of baculoviruses 

lxl 06 Sf9 cells were seeded as monolayer cultures in 30mm plates and left to 
attach for at least 30 minutes. The medium was poured off and virus inoculum in 1 OOul 
growth medium was dripped onto the surface of the monolayer. The plates were 
incubated for 30 minutes at room temperature, occasionally tilting the plates to prevent 
the monolayer from drying out. The monolayer was overlaid with a mixture of 1ml 

growth medium and 3% (w/v) "Seaplaque" agarose (FMC, ME) warmed to 37°C and 
gently swirled to mix in the inoculum. Once set a liquid overlay of lml growth medium 
was applied. The plates were incubated in a humid environment for 3-5 days. 

Visualisation of plaques was achieved by addition to the liquid overlay lml 
phosphate buffered saline (PBS) containing neutral red solution at 01% (w/v) from a 
stock solution of 1% (w/v) (Sigma, Dorset,UK). Plaques were visible as circular regions 
devoid of stain up to 3mm in diameter. 

(x) Scale-up of baculovirus vectors and protein expression 

200p.l of the primary seed stock was used to infect 1x10^ Sf9 monolayer cell 
cultures in 30mm plates. The seed stock was dripped onto the monolayer and incubated 
for 20 minutes at room temperature, and then overlaid with lml growth medium. The 
plates were incubated at 27°C in a humid environment for 3-5 days. The supernatant 
from these cultures is Passage 1 virus stock. The virus titre was determined by plaque 
assay and further scale up was achieved by infection of monolayer cultures or suspension 
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cultures at a multiplicity of infection (moi) of 0.1 - Vims stocks were passaged a 
maximum of six times to minimise the emergence of defective virus. 

Expression of recombinant proteins was achieved by infection of monolayer or 
suspension cultures in growth medium with or without foetal calf serum (FCS). Where 
FCS was omitted cells conditioned to growth in the absence of FCS were used. Virus 
stocks between passage 1 and 6 were used to infect cultures at a moi of >5 per cell. 
Typically, infected cultures were harvested 72 hours post infection and recombinant 
proteins isolated either from the supernatants or the cells. 

(xi) Selection of stable variants of (C3d)3 or other concatenated protein domains 
expressed in insect cells using baculovirus vectors 

Baculovirus transfer vector plasmids encoding uncharacterised C3d (or other 
protein domain) concatamers containing one or more fuzzy domains are transformed into 
competent E.coh XLl-blue strain according to Method (v). The resulting colonies each 
contain a single isolate which may be comprised of one or more fuzzy domains and may 
also contain one copy of the domain with the original DNA sequence derived from the 
parent plasmid (in the case of C3d, pSG.C3dYL). Individual colonies are scaled up and 
DNA extracted according to Method (iii) (rniniprep method). Aliquots of DNA are 
corxansfected into insect cells with BacPak6 linear DNA using a modification of the 
manufacturers protocol: 2 x 10 5 Sf9 cells are seeded into flat-bottomed microtitre plates 
and allowed to attach for 30 minutes in serum free medium. If cells are grown in 
medium containing serum the monolayer is washed three times in serum free medium 
The volume of medium in each well is adjusted to a standard volume (typically 50- 
150^1). 

Each plasmid isolate 0.5p,l of rniniprep plasmid DNA was taken (at 
concentrations ranging from 0.5 to 5\x%f\x\) is mixed with O.luJi to 2\±\ of BacPak6 DNA 
(Clontech) (at concentration supplied) and 5-25u.l 50pig/ml of Lipofcctin reagent (Life 
Technologies Inc.) and incubated at room temperature for 15 minutes to allow 
lipofectin/DNA complexes to form. The mixture is dripped onto the surface of the 
medium in a single microtitre well and swirled gently with the pipette tip- A single 



31-12-97 14=28 01223 360280 P. 20 R-067 Job-268 

31-DEC-19S? 14:42 fiH)DIE PND GROSE 01223 360230 P. 20 




17 



microtitre dish may contain transfections of up to 96 plasmid isolates, one well per 
isolate. The transfections are incubated in a humid environment at 27°C for five hours. 

The medium containing the Lipofectm/DNA is then removed from the monolayer 
and replaced with 100u.l medium which may contain 1% FCS unless this is considered 
likely to interfere with subsequent assays. The plate is incubated in a humid 
environment at 27°C for three to five days. After three days the plate is visibly inspected 
for evidence of baculo virus infection. When a significant number of wells show signs of 
infection (retarded cell growth and enlarged or dumbell-shaped cells) aliquots of culture 
supernatant are harvested from each well stored under sterile conditions for subsequent 
expansion. Further aliquots of culture supernatant or cells are subjected to biochemical 
or biological assay to determine the presence or absence of protein domain monomers 
and oligomers. For example, 20ui aliquots of supernatant may be subjected to SDS- 
PAGE and/or Western blotting (see Methods Section (xv) and (xvi)). 

Selected clones showing significant yields of oligomers are scaled up from stored 
supematants according to Method (x). PI asm ids corresponding to the selected clones are 
also scaled up for large-scale plasmid DNA extraction according to Method (iii) and 
sequenced according to Method (vi). 

(xii) pBROC4l3 

The plasmid pT7-7 [Tabor, S (1990), Current Protocols in Molecular Biology, F. 
A. Ausubel, Brent, R. R Kingston, D. D. Moore, J. G. Seidman, J. A. Smith, and KL. 
Struhl, (eds). pp.16.2.1-16.2.1 1 7 Greene Publishing and Wiley-Interscience,New York.] 
contains DNA corresponding to nucleotides 2065-4362 of pBR322 and like pBR322 can 
be mobilised by a conjugative plasmid in the presence of a third plasmid ColK. A 
mobility protein encoded by ColK acts on the nic site at nucleotide 2254 of pBR322 
initiating mobilization from this point. pT7-7 was digested with Lspl and Bgtll and the 
protruding 5' ends filled in with the Klenow fragment of DNA Polymerase I. The 
plasmid DNA fragment was purified by agarose gel electrophoresis, the blunt ends 
ligated together and transformed into E. coli DH1 by electroporation using a Bio-Rad 
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Gene Pulser and following the manufacturers recommended conditions. The resultant 
plasmid pBROC413 was identified by restriction enzyme analysis of plasmid DNA. 

The deletion in pBROC4I3 from the Lspl site immediately upstream of the f 10 
promoter to the Bglll site at nucleotide 434 of pT7-7 deletes the DNA corresponding to 
nucleotides 2065-2297 of pBR322. The nic site and adjacent sequences are therefore 
deleted making pBROC413 non mobilizable. 

(xiii) pDB1013 

The construction of this plasmid, a derivative of pBROC413, is fully described in Dodd 
et al (1995) Protein Expression and Purification 6; 727-736. 

(xiv) Protein Purification 

A number of standard chromatographic techniques can be used to isolate protein 
domain oligomers and fusions such as (C3d)3 - for example, such methods as ion- 
exchange and hydrophobic interaction matrixes chromatography utilising the appropriate 
buffer systems and gradient to purify the target proteins. The chromatographic 
properties of the C3d or other domain-containing fusion polypeptides will vary 
depending on the nature of the fusion protein. 

C3d molecules and derivatives constructed to contain the C-terminal affinity tag 
Glu-Glu-Phe can be purified by affinity chromatography using the YL1/2 antibody 
coupled to Sepharose 4B as described by Dempsey et al (WO 06 1 7625; 
PCT/GB95/02851). 

(xv) Sodium Dodecyl Sulphate Polyacrylamide Gel Electrophoresis (SDS-PAGE) 

SDS-PAGE was carried out generally using the Novex system (Novex GmbH, 
Heidleburg) according to the manufacturer's instructions. Pre-packed gels of containing 
a 4- 20% acrylamide gradient were used. Samples for electrophoresis, including protein 
molecular weight standards (for example LMW Kit, Pharmacia, Sweden or Novex Mark 
12, Novex, Germany) were usually diluted 1 :1 in 2% (w/v) SDS - containing buffer (with 
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or without 5% (v/v) 2-mercaptoethanol), and left at room temperature for 0.5h before 
application to the gel. 

(xvi) Immunoblotting 

(a) Dot blot 

Immobilon membranes (Millipore, Middlesex, UK) were activated by immersion in 
methanol for 20 seconds and then washed in PBS for five minutes. The membrane was 
placed into a vacuum manifold Dot Blotter (Bio-Rad Laboratories, Watford, UK). Crude 
extracts from cells or culture supernatants were transferred onto the membrane by 
applying a vacuum and washed through with PBS. Without allowing the membrane to 
dry out, the Dot Blotter was dismantled and the membrane removed. 

(b) Western Blotting 

Samples of cell extracts and purified proteins were separated on SDS-PAGE as 
described in Section (xv). The Immobilon membrane was prepared for use as in (a) 
above. The gel and the membrane were assembled in the Semi-Dry Transfer Cell 
(Trans-Blot SD, Bio-Rad Laboratories) with the Immobilon membrane towards the anode 
and the SDS-PAGE on the cathode side. Between the cathode and the gel were placed 3 
sheets of Whatman 3M filter paper cut to the size of the gel pre-soaked in a solution of 
192mM 6-amino-n-caproic acid, 25mM Tris pH 9.4 containing 10% (v/v) methanol. 
Between the anode and the membrane were placed two sheets of Whatman 3M filter 
paper cut to the size of the gel and soaked in 0.3 M Tris pH 10.4 containing 10% (v/v) 
methanol next to the anode and on this was laid a further sheet of Whatman 3M filter 
paper pre-soaked in 25mM Tris pH 10.4 containing 10% (v/v) methanol. 

The whole-assembled gel assembly was constructed to ensure the exclusion of air 
pockets. The proteins were transferred from the SDS-PAGE to the Immobilon 
membrane by passing 200mA current through the assembly for 30 minutes. 

(c) Immunoprobing of Dot Blot and Western Membranes 

The membranes were blocked by incubating the membrane for Ih at room 
temperature in 50ml of lOmM phosphate buffer pH 7.4 containing 150mM NaCl, 0.02% 
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(w/v) Ficoll 400, 0.02% (w/v) polyvinylpyrolidine and 0.1% (w/v) bovine serum albumin 
(BSA). The appropriate primary antibody was diluted to its working concentration in 
antibody diluent, 20mM sodium phosphate buffer pH 7.4 containing 0.3M NaCl, 0.5% 
(v/v) Tween-80 and 1 .0% (w/v) BSA. The membrane was incubated for 2h at room 
temperature in 50ml of this solution and subsequently washed three times for 2 minutes 
in washing buffer, 20mM sodium phosphate pH 7.4 containing 0.3M NaCl and 0,5% 
(v/v) Tween-80. The membrane was then transferred to 50ml of antibody diluent buffer 
containing a suitable dilution of the species specific antibody labelled with the 
appropriate label, e.g. biotin, horse radish peroxidase (HRP), for the development process 
chosen and incubated for 2h at room temperature. The membrane was then washed in 
washing buffer as described above. Finally, the blot was developed according to the 
manufacturer's instructions. 

The appropriate dilution of antibody for both the primary and secondary 
antibodies refers to the dilution that minimises unwanted background noise without 
affecting detection of the chosen antigen using the development system chosen. This 
dilution is determined empirically for each antibody. 

(xvii) Measurement of Biological Activity 

The biological function of C3d monomer produced in baculovirus can be tested 
for its ability to bind to its receptor, complement receptor-2 (CR2). As C3d is a product 
of the process of complement activation and subsequent degradation of C3b by the serum 
protease Factor I, it was also of interest to test C3d for its possible effect on complement 
activation using a classical pathway haemolytic assay. 

(a) Competitive Binding Assay of C3d to Raji Cells 

In this assay the ability of a new construct expressing at least one unit of C3d to 
compete with a control 125 I-HEL-C3d for CR2 binding sites on the surface of Raji cells, a 
B-lymphoblastoid cell line, is assessed. Raji cells, 5 xlO 7 to 7x1 0 7 cells/ml are incubated 
for Ih at 4°C with InM 125 I-HEL-C3d and incremental concentrations of C3d containing 
molecule of interest. The cells are then centrifuged through a dibutyl-diiso-octyl- 
phthalate cushion, and the amount of radioactivity associated with the pellet determined. 
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From this data the amount of the C3d-containing protein under test necessary to produce 
a 50% reduction in InM !25 I-HEL-C3d can be determined. 

(b) Competitive ELISA demonstrating C3d Binding to CR2 

C3d is used to coat 96-well microtitre plates at 5ug/ml in 0.1M NaHC03 by incubating 
overnight at 4°C. The plates are then blocked in 1% BSA, 0.1% Tween-20 in PBS for 
lh at room temperature and subsequently incubated with either 25^1/well of C3d or 
variable test volumes of the C3d-containing molecule of interest to generate a dose 
response. To the wells is added 25u.l/well of a sub-saturating concentration of co-ligated 
CR2.IgGl at 125pM and incubated for lh at room temperature. The plates are then 
washed three times with PBS containing 0.1% (v/v) Tween-20. The amount of CR2 
bound to the immobilised C3d is detected using a 1 :3000 dilution of HRP-Iabeled goat 
anti-mouse- IgGl -antibody in PBS containing 0.1% (v/v)Tween-20 and incubated for lh 
at room temperature the wells were then washed as described above. The presence of 
HRP-antibody is detected using the TMB Peroxidase EIA Substrate Kit according to the 
manufacturer's instructions (Bio-Rad, UK). 

(c) Anti-complement Activity Measured by the Haemolysis of Sheep Erythrocytes 

Measuring the inhibition of complement-mediated lysis of sheep erythrocytes 
sensitised with rabbit antibodies (Diamedix Corporation, FL, USA) assesses functional 
activity of complement inhibitors. Human serum diluted 1 : 125 or 1 : 1 00 in 0. 1 M 
HEPES/0. 15 M NaCl/0,1% gelatin pH 7.4 was used as a source of complement. The 
scrum was pooled from volunteer blood donations essentially as described in J.V.Dacie & 
S.M. Lewis, Practical Haematology, Churchill Livingstone, Edinburgh. 1975. Briefly, 
blood (10-20ml) was warmed to 37°C for 5 minutes, the clot removed and the remaining 
serum clarified by centrifugation. The serum fraction was split into small aliquots and 
stored at -196°C. Aliquots were thawed as required and diluted in the HEPES buffer 
irnmediately before use. 

Inhibition of complement-mediated lysis of sensitised sheep erythrocytes is 
measured using a standard haemolytic assay using a v-bottom microtitre plate format as 
follows: 
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50^1 of a range of concentrations of inhibitor (typically in the region of 0.1-100 
nM) diluted in HEPES buffer are mixed with 50ul of the diluted serum and lOOul of 
sensitised sheep erythrocytes and then incubated for 1 hour at 37°C Samples are spun 
at 1600rpm at ambient temperature for 3 minutes before transferring 150u-l of supernatant 
to flat bottom microtitre plates and determining the absorption at 410nm. Maximum 
lysis (Amax) is determined by incubating serum with erythrocytes in the absence of any 
inhibitor. Background lysis (Ao) is determined by incubating erythrocytes in the 
absence of any serum or inhibitor to check whether the inhibitor itself had any effect on 
lysis, erythrocytes were incubated with inhibitor alone, none of the compounds had any 
direct effect on lysis of the erythrocytes. Inhibition is expressed as a fraction of the total 
cell lysis such that IH50 represents the concentration of inhibitor required to give 50% 
inhibition of lysis. 

IH = A- Ao 
Amax - Ao 

where 0 is equivalent to complete inhibition and 1 equals no inhibition. 

(xviii) Reduction of disulphides and modification of thiols in proteins 

There are a number of methods used for achieving the title goals. The reasons it 
may be necessary to carry out selective reduction of disulphides is that during the 
isolation and purification of multi-thiol proteins, in particular during refolding of fully 
denatured multi-thiol proteins, inappropriate di sulphide pairing can occur. In addition, 
even if correct disulphide paring does occur, it is possible that a free cysteine in the 
protein may become blocked, for example with glutathione. These derivatives are 
generally quite stable. In order to make them more reactive, for example for subsequent 
conjugation to another functional group, they need to be selectively reduced, with for 
example dithiothreitol (DTT) or with Tris (2-carboxyethyl) phosphine.HCl (TCEP) then 
optionally modified with a function which is moderately unstable. An example of the 
latter is Ellman's reagent (DTNB) which gives a mixed disulphide. In the case where 
treatment with DTNB is omitted, careful attention to experimental design is necessary to 
ensure that dimerisation of the free thiol-containing protein is minimised. Reference to 
the term 'selectively reduced' above means that reaction conditions eg. duration, 
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temperature, molar ratios of reactants have to be carefully controlled so that disulphide 
bridges within the natural architecture of the protein are not reduced. All the reagents 
are commercially available e.g. from Sigma (Poole, Dorset) or Pierce & Waniner 
(Chester, Cheshire). 

The following general examples illustrate the type of conditions that may be used 
and that are useful for the generation of free thiols and their optional modification. The 
specific reaction conditions to achieve optimal thiol reduction and/or modification are 
ideally determined for each protein batch. 

TCEP may be prepared as a 20mM solution in 50mM HEPES (approx. pH 4.5) 
and may be stored at -40°C. DTT may be prepared at lOmM in sodium phosphate pH 
7.0 and may be stored at -40°C. DTNB may be prepared at lOmM in sodium phosphate 
pH 7.0 and may be stored at -40°C. All of the above reagents are typically used at molar 
equivalence or molar excess, the precise concentrations ideally identified experimentally. 
The duration and the temperature of the reaction are similarly determined experimentally. 
Generally the duration would be in the range 1 to 24 hours and the temperature would be 
in the range 2 to 30° C. Excess reagent may be conveniently removed by buffer 
exchange, for example using Sephadex G25. A suitable buffer is 0.1M sodium 
phosphate pH 7.0. 



EXAMPLES 

Example 1. Expression and isolation of monomer ic murine C3d f(C3d)l] (SEQ ID 
No 26) and trimeric C3d [(C3d)3) (SEQ ID No. 27) in Escherichia coli- Evidence 
for instability of the trimeric construct in a bacterial host. 

(a) Construction of pi as mid pDB1033 encoding (C3d)l 

pDB1013 was digested with Spel / Nde\ and the vector fragment was isolated from 
a 1.1% (w/v) agarose gel and purified using a Qiagen gel extraction kit. 
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The vector termed pSG.C3di.YL described in patent WO 96/17625 encodes one 
copy of the C3d coding sequence preceded by a signal peptide derived from dog pre-pro- 
insulin and is C-terminally tagged by the tripeptide epitope (Glu-Glu-Phe) recognised by 
the commercially available YL1/2 antibody (Serotec, Oxon). The vector. was cut with 
Bgfll and XbaJ; the digest was run on a 1 . 1 % (w/v) agarose gel and the 1 000 base pairs 
BglU / Xbal fragment isolated. 

These two fragments were ligated with the BglE I Ndel linkers (SEQ ID Nos 28 
and 29) and transformed into competent E. coli JM109 cells under standard conditions 
and clones isolated. 



(b) Expression of (C3d)l from pDBl033 

pDBl033 was transformed into calcium chloride competent E. coli BL21(DE3) 
and resultant colonies were isolated and checked for plasmid content. To express protein 
from pDB1033 in E. coli BL21(DE3), a single colony was inoculated into 10ml LB 
media (20g/l tryptone, 15g/l yeast extract, 0.8g/l NaCl, 0.2g/l Na 2 HP0 4 , 0.1 g/1 KH 2 P0 4 ) 

containing 50u.g/ml ampicillin (Sigma). The culture was grown for 6 hours at 37°C, 
260rpm before being used to inoculate 1 00ml of the same media containing 50p.g/ml 
ampicillin. Growth was under the same conditions overnight. 25ml of each culture was 
then used to inoculate 3 x 500ml of the same media with SO^g/ml ampicillin in 31 
Erlenmeyer flasks. Cells were grown to an OD of about 1 .2 at A^oonm. IPTG 
(isopropyl p-D galactopyranoside) was added to a final concentration of ImM and cells 
incubated for a further about 3.5 hours before harvesting by centrifugation at 8000g/10 
minutes. Cell pellet from 1.51 of culture was processed immediately. 

(c) Isolation of (C3d)l 

The methods described are essentially those detailed for the isolation of SCR 1 -3 
of Complement Receptor 1 in Dodd. et al (1995) Protein Expression and Purification 6: 
727-736, with some modifications. 



Isolation of solnbilised inclusion bodies 
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n^e ce n pellet of E. coli BL21(DE3) (pDB1033) was resuspended in 50mM Tris/ 
50mM NaCl/ ImM EDTA pH 8.0 (approximately 150ml). The suspension was 
transferred to a glass beaker surrounded by ice and sonicated (Heat systems - Ultrasonics 
W380; 50 x 50% pulse, pulse time =- 5 seconds.) for 3 minutes and then spun at 15000g 
for 10 minutes. The supernatant was decanted and discarded. The inclusion body pellet 
was stored at -40°C for 7 days. It was then resuspended in 20mM Tris/8M urea/ ImM 
EDTA/50mM 2 mercaptoethanol pH 8.5 (120ml) at room temperature by vigorous 
swirling, then left for 1 hours at room temperature with occasional swirling. 

(ii) Initial purification using Macroprep High Q 

To the viscous solution was added Macroprep High Q (Bio-Rad; 24g wet 
weight) that had been water washed and suction-dried. The mixture was swirled 
vigorously and left static for l-2h at room temperature. The supernatant was decanted, 
sampled and discarded. The remaining slurry was resuspended to a uniform suspension 
and poured into a glass jacket and allowed to settle into a packed bed. The column was 
equilibrated with 0.02M Tris/8M urea/0.0 5 M 2-mercaptoethanol/0.001M EDTA pH 8,5 
at room temperature. When the A28O of the eluate had stabilised at baseline, the buffer 
was changed to equilibration buffer additionally containing 1M NaCl. A single A28O 
peak was eluted by the 1M NaCl-containing buffer; the volume was approximately 80ml. 
SDS-PAGE under reducing conditions showed the product contained one major species, 
with a molecular weight around 35000. The solution was stored at -40°C. 



(d) Construction of p) as mid pDB1032 encoding (C3d)3 

pDB1013 was digested with Spel / Ndel and the vector fragment isolated from a 
1.1% (v/w) agarose gel and purified using a Qiagen gel extraction kit. 

The vector termed pSG.(C3d)3 YL described in patent WO 96/17625 encodes 
three identical copies of the C3d coding sequence preceded by a signal peptide derived 
from dog pre-pro-insulin. The C3d domains are separated by a linker (Gly4-Ser) 2 , and 
there is tripeptide epitope (Glu-Glu-Phe) at the C-terminal of the final C3d domain that is 



25 



14:28 

DEC- 1997 14:46 



CT 223 3EG2B0 



R-D67 



Job-2E6 




E AND GROSE 



01223 360280 



P 




26 



recognised by the commercially available YL1/2 antibody (Serotec, Oxon.). The vector 
was cut with Bglll and XbaX ; the digest was run on a 1 . 1 % (v/w) agarose gel and the 3000 
base pairs Bglll I Xbal fragment isolated. These two fragments were ligated with the 
BglU I Ndel linkers (SEQ ED Nos 28 and 29) and transformed into competent E. coli 
JM109 cells under standard conditions and clones containing pDB1032 were isolated. 

(b) Expression of (C3d)3 from pDB1032 

pDB 1 032 was transformed into calcium chloride competent E. coli BL2 1 (DE3) 
and resultant colonies were isolated and checked for plasmid content. To express protein 
from pDB1032 in E. coli BL21(DE3), a single colony was inoculated into 10ml LB- 
phosphate (20g/l tryptone, 15g/l yeast extract, 0.8g/l NaCl, 0.2g/l Na2HP04, 0.1 g/1 

KH2PO4) containing 50ng/ml ampicillin. The culture was grown overnight at 37°C, 
230rpm. before being used to inoculate 50ml of the same medium containing 50^g/ml 
ampicillin. Growth was under the same conditions overnight- 1 5ml of each culture 
was then used to inoculate 3 x 600ml of the same medium with 50ng/ml ampicillin in 31 
Erlenmeyer flasks. Cells were grown to an OD of about 1 . 1 at A^oonm. IPTG was 
added to a final concentration of ImM and cells allowed to continue growth for a further 
about 3.5 hours before harvesting by centrifugation at 8000g/10 minutes. Pellet from 1.51 
of culture was processed immediately. 

(c) Isolation of (C3d)3 

The methods described are essentially those detailed for the isolation of SCR1-3 
of Complement Receptor 1 in Dodd et al (1995) Protein Expression and Purification 6: 
727-736, with some modifications. 

(i) Isolation of solubilised inclusion bodies 

The cell pellet of E. coli BL21(DE3) (pDB1032) was resnspended in 50mM 
Tris/50mM NaCl/lmM EDTA pH 8.0/0.1mM PMSF (approx. 100ml). The suspension 
was transferred to a glass beaker surrounded by ice and sonicated (Heat systems - 
Ultrasonics W380; 50 x 50% pulse, pulse time = 5 seconds.) for 3 minutes and then spun 
at 15000g for 10 minutes. The supernatant was decanted and discarded. The inclusion 
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body pellet was stored at -^40°C for 3d. It was then resuspended in 20mM Tris/8M 
urea/1 mM EDTA/50mM 2-mercaptoethanoI pH 8.5 (120ml) at room temperature by 
vigorous swirling, then left for 2h at room temperature with occasional swirling. 

(ii) Initial purification using Macroprep High Q 

To the viscous solution was added Macroprep High Q (Bio Rad; 24g wet weight) 
that had been water washed and suction-dried. The mixture was swirled vigorously and 
left static for l-2h at room temperature. The supernatant was decanted, sampled and 
discarded. The remaining slurry was resuspended to a uniform suspension and poured 
into a glass jacket and allowed to settle into a packed bed. The column was equilibrated 
with 0.02M Tns/8M urea/0.05M 2-mercaptoethanol/0.001 M EDTA pH 8.5.at room 
temperature. When the A28O °f the eluate had stabilised at baseline, the buffer was 
changed to equilibration buffer additionally containing 1M NaCL A single A28O P ea k 
was eluted by the 1M NaCl-containing buffer; the volume was approximately 80ml. 
SDS PAGE under reducing conditions showed that the product contained one major 
species, with a molecular weight around 35000, and additional species with molecular 
weights close to the target molecular weight of around 105000 and believed to be the 
target species. The solution was stored at -40 A C. 

These results show that expression in a bacterial host of either monoTneric or trimeric 
C3d genes gives rise to substantially the same product — the monomelic protein. The 
origins of this instability were not determined in this experiment. 

Example 2. Expression and isolation of C3d oligomers expressed in insect cells 
using baculovirus vectors: Evidence for homologous recombination as the origin of 
the low yield of trimer and the high yield of a raonomeric C3d form. 

(a) Construction of plasmid pBC68-01 encoding (C3d)3-EEF 

The vector termed pSG.(C3d) 3 YL, described in patent WO 96/17625 encodes 
three identical copies of the C3d coding sequence preceded by a signal peptide derived 
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from dog pre-pro- insulin. The C3d domains are separated by a linker (Gly4~Ser) 2 , and 
there is tripeptide epitope (Glu-Glu-Phe) at the C-terminal of the final C3d domain that is 
recognised by the commercially available YL1/2 antibody (Serotec, Oxon.) 

pSG.(C3d)3-VL was digested with restriction endonucleases EcoRl and Xbal and 
the 3075 base pairs band was isolated from an 1% (w/v) agarose gel using GeneClean 
extraction kit according to the manufacturer's instructions- The baculo virus expression 
vector pBacPak8 (Clontech) was digested with restriction endonucleases EcoRl and Xbal 
and the 5.6k base pairs band was isolated from a 0.7% (w/v) agarose gel. 

The two fragments were li gated with T4 DNA ligase to give pBP68~01 . The 
li gated plasmid was transformed into competent E, coli XL 1 -blue purchased from 
Stratagene. Ampicillm resistant trans formants were selected on LB -agar plates 
containing 50u.g/ml ampicillin. Resulting colonies were analysed by restriction 
endonuclease digestion. Plasmid DNA was extracted from E. coli containing pBP68-01 
using Qiaex mini or midi prep kits according to manufacturer's protocols. 

b) Generation of baculovirus vBP68-01 for expression of C3d oligomers 

Purified pBP68-01 plasmid DNA was used to generate a recombinant baculovirus 
expression vector for (C3d)3-YL as described in General Methodology Section (viii). 
Resulting plaques were picked and used to generate primary seed stocks. Recombinant 
baculoviruses expressing C3d oligomers were identified by immuno-dot blot of culture 
supernatants of Passage 1 virus stocks, probing with an antibody to human C3d obtained 
from Dako (Ely, Cambs) at a dilution of 1 in 2000 the blot was developed using the ECL 
kit (Amersham, Middlesex). Those showing a positive signal on the dot blot were further 
scaled up and used to generate Passage 3 or 4 working stocks with vims titres between 
1 0^ and 1 0^ plaque forming units (pfu) per ml. 

(c) Expression of C3d oligomers in insect cells 

Sf9 cells adapted for growth in the absence of serum were grown in suspension 
culture to a cell density of 1.5xl0 6 cells/ml in 4x50ml of media in 200ml Erlenmeyer 
flasks shaken at 20rpm. These cells were then infected by addition of lxlO 9 pfu/ml 
Passage 3 working stocks to give a final concentration of 5pfu/cell. The supernatant 72 
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hours post infection was found to contain three species of C3d oligomer: (C3d)3, as 
encoded in the original vector, in trace amounts, (C3d)2, also in trace amounts and C3d 
monomer relatively large amounts (5-lOmg/l). The harvest medium, 2 1 Oml, was stored 
at -40° C until further processed. 

(d) Purification of C3d from Sf9 Baculovirus Harvest Media 

Approximately half the harvest supernatant, 1 13ml, was adjusted to pH 7.4 with 
1 OM NaOH and the conductivity of the solution measured usually the salt concentration 
was below 0.1M if it was above this concentration the solution was diluted with the 
appropriate amount of 50mM HEPES pH 7.4 to bring the salt concentration to 80mM. 

The harvest supernatant was loaded onto a Macroprep-high-Q column, bed volume; 
25ml pre equilibrated in 50mM HEPES pH 7.4 and then washed with 50mM HEPES pH 
7.4 until the baseline stabilised. The linear flow rate for loading and washing the column 
was 89cm/h and was decreased to 60cm/h during elution. The bound proteins were 
eluted with a linear 0-1M NaCl over 10 column volumes and 5ml fractions collected. 
The column material was heavily contaminated with medium components and was 
discarded after a single use. 

The C3d eluted between 0.1 5M and 0.3M NaCl as revealed by reducing SDS- 
PAGE. The fractions eluting between 0.1 5M NaCl and 0.2M NaCl were 50% pure, 
(C3d)2 and (C3d)3 eluted after the C3dl . The C3dl pooled fractions, 40ml, were 
concentrated to 2ml by using 40ml ultrafiltration stirred cell containing a YM10 
membrane (Amicon, Glos.). No further purification was performed prior to 
characterisation of the product -see Sections (e) and (f) below. 

(e) Physioctaemical Characterisation of the monomelic species 
(i) N-Terminal Sequence 

A sample containing the monomelic C3d was separated on a reducing SDS- 
PAGE. The proteins were electro-transferred to a Problott (Applied Biosystems) as 
described in General Methods (Section (xvi) (b)). Post electro transfer the membrane 
was rinsed in 1 00% methanol for 1 0s and then stained with a solution containing 0- 1 % 
(w/v) amido black (Sigma), 40% methanol, 1% acetic acid for 30s and subsequently 
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destained by a 50% methanol solution. The band corresponding to the C3d monomer 
was excised. The N-terminal sequence of the protein was analysed by Edman 
Degradation using a Beckman Automatic Amino Acid Analyser (Beckman). A single 
N-terminal sequence was obtained, this sequence corresponded to residues 25 to 44 of the 
C3dl sequence expressed in Baculovirus/Sf9 cells (SEQ ID. No.37) and indicated that 
the secreted monomer had its signal sequence removed. 



(ii) Mass Spectrometry. 

The mass of the C3d was determined by electrospray mass spectrometry on a solution 
containing the purified C3d. A smgle peak was found with a mass of 34279 Da. This 
corresponds to a sequence starting at the N-terminal sequence as described in Section (d) 
(i) above but containing the C-terminal antibody affinity tag (Glu-Glu-Phe). This 
sequence shown below (SEQ ID no.37). 



(iii) Western Blotting 

The purified C3d was western blotted according to the methods described in Section (xvi) 
(b) following separation on a reducing SDS-PAGE. The blot was probed with primary 
antibodies to C3d and the affinity tag Glu-Glu-Phe. 

(T) C3d Antibody Primary rabbit polyclonal antibody to human C3d (Dako, Cambs. 
UK) and the secondary antibody a biotin -labelled polyclonal goat anti-rabbit IgG 
antibody (Amersham, Bucks.) were used at 1 : 500 dilution of the stock. The blot was 
developed using the Immunogold Silver Staining Kit (Amersham) according to the 
manufacturer's instructions. The developed blot showed a single immunoreactive band 
corresponding to the correct position of the C3d monomer on the SDS-PAGE. 
(2) Antibody to Affinity Tag. Monoclonal rat antibody from clone YL1/2 (Serotec) 
recognising the C-terminal tag Glu-Glu-Phe was used a 1 :500 dilution to probe the 
immunoblot. The secondary antibody biotin-labelled polyclonal goat anti-rat antibody 
was used at a dilution of 1:1000. The blot was developed using the Immunogold Silver 
Staining Kit (Amersham, UK) according to the manufacturer's instructions. The 
developed blot showed a single immunoreactive band corresponding to the correct 
position of the C3d monomer on the SDS-PAGE. This result indicated the presence of 
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the epitope tag in a major proportion of the C3d monomers- 

These Sections (i to iii) suggest the C3d expressed in harvest supernatant contains 
the N- terminal of the first C3d unit containing the signal sequence encoded in the plasmid 
pBP68-01 and the C-termmal of the molecule the information encoded by the third unit 
of C3d containing the affinity tag in the plasmid. The amino acid sequence of this 
combination is described in SEQ ID no. 37. 

(f) Biological Assay of the C3d monomer 

The biological function of C3d monomer produced in baculovirus was tested by 
its ability to bind to its receptor, complement receptor-2 (CR2). 

(i) Competitive Binding Assay of C3d to Raji Cells 

The binding was performed as described in General Methods Section (xvii) (a). 
The amount of radioactivity associated with the cell pellet was used to construct a 
binding curve of l25 I-HEL-C3d bound to the Raji cells, this showed a 50% reduction in 
binding of the label was achieved using of binding concentration of 2.5uM of either the 
baculovirus /Sf9 C3d or C3d expressed in COS cells. 



(ii) ELISA demonstrating C3d Binding to CR2 

The ELISA was performed as described in General Methods Section (xvii) (b). 
The monomer gave a I 50 of 2uM for the baculovirus/Sf9 material. A similar figure was 
obtained for the COS cell produced C3d 

(iii) Anti Haemolytic Activity of C3d 

The anti -haemolytic assay was performed as described in the General Methods 
Section (xvii) (c) on a purified sample of C3dl. The C3dl protein product inhibited 
complement-mediated lysis of sensitised sheep red blood cells with an IH50 of 
approximately 700nM. The C3d activity in this assay is probably as a competitive 
inhibitor of C3 the complement pathway. 
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The data described in this Section provides evidence that C3d produced in the 
baculovirus/Sf9 expression system is equivalent in activity to that produced in COS cells. 
However this system produces at least 50- fold more C3d per litre than the COS system 
offering significant fermentation savings. 

(g) Characterisation of baculovirus vector produced from pBP6$-01- 

Characterisation of the monomeric protein led to the hypothesis that homologous 
recombination had occurred within the repeated C3d coding sequence in the baculo virus 
vector leading to deletion of two copies of C3d. PCR analysis was used to characterise 
the viruses present in the cultures, which produced the monomeric species. 

DNA was extracted from the cell pellets from 50ml samples from two 
independent production runs. The cells were suspended in 5ml of a solution containing 
lOmM Tris, lOraM EDTA, lOOug/ml proteinase K pH 7.6. After re-suspension SDS 
was added to 0.5% (w/v), mixed gently to avoid shearing the DNA and incubated for 60 
minutes at 31°C. An equal volume of phenol: chloroform : isoamyl alcohol, 1:24:1, 
(Sigma) was added and mixed by inversion for 5 minutes then centrifuged at 1 2000g for 
5 minutes. The upper aqueous layer was transferred to a clean rube and a 0.6 volume of 
isopropanol was added to precipitate the DNA. A clot of DNA was produced which was 
transferred into 5ml 80% (v/v) ethanol for 1 minute. The ethanol was poured ofT and 
drained without allowing the pellet DNA clot to dry completely. The DNA clot was 
dissolved in 1ml deionised water containing 50 units of RNase A (Sigma). 

A PCR reaction was carried out to determine the length of the DNA encoding 
C3d. The primers used for the PCR were homologous to sequences flanking the coding 
region, corresponding to the position of the EcoKl and Xbal restriction sites used in the 
cloning. The sequence of the forward primer #3917 1 was GAATTCCTAGCTTGCTTG 
(SEQ. ID No 30) and the reverse primer #39172 was TCTAGAGTCGACCAGAC (SEQ. 
ID No. 31). The PCR conditions were as follows: 1 ul DNA solution (from cell pellets), 
lu.1 lOmM dNTPs, SOpMol each of forward and reverse primers (SEQ. ID no. 30 and 31), 
5ul xlO Taq polymerase buffer (Promega), 3ul 25mM MgCl2, deionised water to 49u.l 
and lu,l Taq polymerase (5 units) (Promega). PCR was carried out m a thermal cycler 
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with the following cycle: Step 1: 95°C for 1 minute, Step: 2 95°C for 30 seconds, Step 3: 
55°C for 30 seconds, Step 4: 72°C for 1 minute, Step 5: 72°C for 5 minutes. Steps 2-4 
were repeated 30 times prior to step 5. A tenth of the products were run on 1% (w/v) 
agarose gels. The expected size for (C3d)3 was 3088 base pairs and for C3d monomer 
was 1217 base pairs . The PCR product of the reactions was clearly a 1217 base pairs 
band with no evidence of a 3088 base pairs band. 

These data, and the protein characterisation data strongly suggest that a 
homologous recombination event has occurred resulting in the deletion of two of the 
three C3d genes, leaving the signal peptide and the amino terminal portion of the first 
domain and the carboxy terminal portion and the Glu-Glu-Phe tag of the third domain. 
The point of crossover is impossible to determine as the coding sequences in the three 
copies of C3d are identical. 

Subsequent attempts to isolate a stable virus containing three C3d domains by 
plaque cloning were unsuccessful. All viruses recovered contained only the single C3d 
domain determined by PCR analysis as described above. 

Example 3. Design and Construction of a Fuzzy Gene encoding C3d monomer 

(a) Synthetic gene design 

Homologous recombination occurs as a result of DNA strand exchange between 
regions of homology. In the present invention a series of synthetic C3d domains are 
constructed in such a way to reduce to a minimum to presence of homologous DNA 
when the sequences are concatenated, without altering the coding sequence of the C3d 
domain. The basis of the invention is to utilise the third base "wobble" phenomenon to 
introduce silent changes throughout the domain. The number of possible variants of this 
is very large, and not cannot be specified individually. Where third base "wobble" is 
introduced the variations shown in Table 1 are included. Certain third base "wobble" 
options are specifically avoided where a codon was found to be rare in mammalian, 
insect or bacterial genomes. Codons witii variable third base positions are described 
hereinafter as "fuzzy" codons. 
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Table 1 



Amino acid 


Codon 


Rare codons 


fuzzy 

LUUUII 


reverse 

fl177V 

codon 


Ala 




IlUllLr 


GCN 


NGC 


Arg 




CGG 2 > 3 
CGA 1 ^'- 5 


rGY 


RCG 


Asn 


AAV 

AA Y 


none 


A AY 


RTT 




GAY 


none 


GAY 


RTC 




TGY 


none 


TGY 


RCA 


Gin 


CAR 


none 


CAR 


YTG 


Glu 


GAR 


none 


GAR 


YTC 


Gly 


GGN 


none 


GGN 


NCC 


His 


CAY 


CAT 2 


CAC 


GTG 


He 


ATH 


ATA 3 


ATY 


RAT 


Leu 


CTN or TTR 


CTA 1 * 2 ' 3 


CTB 


VAG 


Lys 


AAR 


none 


AAR 


YTT 


Phe 


TTY 


none 


TTY 


YAA 


Pro 


CCN 


CCC 3 CCG 1 * 


CCW 


SGG 


Ser 


TCN or AGY 


TCG 1 


TCH 


DGA 


Thr 


ACN 


none 


ACN 


NGT 


Tyr 


TAY 


none 


TAY 


RTA 


Val 


GTN 


none 


GTN 


NAC 



NB: The codons for Met (ATG) and Tip (TGG) are invariant. 
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Key: 



1 


Rare codon in mammalian genome 


2 


Rare codon in insect genome 


3 


Rare codon in bacterial genome 


R 


A or G 


Y 


C or T 


S 


C or G 


w 


A or T 


H 


A or C or T 


B 


C or G or T 


V 


A or C or G 


D 


A or G or T 


N 


A or C or G or T 



Twelve oligonucleotides ranging from 76 to 106 bases in length were synthesised 
encoding the upper or lower strand of the C3d sequence. Oligonucleotide Fuzl (SEQ. 
ID No 1) encoded the first 79 bases of the C3d sequence (forward strand) preceded by 21 
bases including a BglJJ site (in frame with the Bgtil site in pBP68-01) and a Ndel site (in 
frame with the Ndel site in pDBl032 and pDB1033). Of the 26 C3d amino acids codons 
present in Fuzl, the first 20 are encoded by fuzzy codons (with the exception of the 
invariant methionine codon ATG). The carboxy terminal 14 residues of Fuzl are 
invariant, with codons selected where possible to he GC rich to enhance annealing of 
these complementary sequences. This region represents the region of overlap with Fuz2 
(Seq. ID No 2). Oligonucleotide Fuz2 encodes amino acids 23 to 54 of C 3d (reverse 
strand) where the reverse codons for amino acids 23-26 at the 3' end are invariant and 
overlap Fuzl - Amino acids 27 to 49 are encoded by reverse fuzzy codons (with the 
exception of the invariant tryptophan reverse codon CCA). At the 5' end the terminal 14 
residues of Fuz2 are invariant, with reverse codons selected where possible to be GC rich. 
This region represents the region of overlap with Fuz3. 
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Fuz3 through to Fuzl 1 (SEQ. ID No 3 to No 11) were designed along similar 
lines, with a central region of fuzzy codons and invariant, GC-rich ends overlapping by 
13-17 bases with the flanking oligonucleotides. 

Fuzl2 (Seq. ID No 12) overlaps with Fuzl 1 at the 3' end by 14 bases and has the 
central region of fuzzy codons encoding C3d, but the 5' end of the oligonucleotide which 
represents the carboxy terminus of the gene contains a linker region (Gly-Gly-Gly-Gly- 
Ser (in fuzzy codons), a BamHl site, a stop codon and an Eagl site. The BamHl site 
allows for subsequent fusion to the amino-terminal BgUl site of another C3d domain for 
concatenation of multiple domains. The stop codon will be retained only in the carboxy- 
terminal C3d domain, and the Eagl site allows for subsequent cloning into the 
baculovirus vector pBacPakS. 




(b) Gene assembly 

The fuzzy C3d gene was built in stages involving cycles of annealing and 
amplification using the polymerase change reaction (PCR) methodology these amplified 
segments are then ligated together. 

In the first stage the fuzzy oligonucleotides were annealed in pairs through their 
GC-rich invariant overlapping regions in the following combinations; Fuzl + Fuz2, Fuz3 
+ Fuz4, Fuz5 + Fuz6, Fuz7 + Fuz8, Fuz9 + FuzlO, Fuzl 1 + Fuzl 2. To amplify the 
paired fuzzy oligonucleotides PCR primers were designed each strand at the overlapping 
invariant GC rich region at the 5 'end of the odd numbered oligonucleotides and to the 3' 
end of the even-numbered oligonucleotides. These PCR primers incorporated base 
changes that created novel restriction sites at each end of the overlapping pairs of fuzzy 
oligonucleotides. This property allowed trimming of the amplified DNA sequences by 
specific endonucleases to give cohesive ends that anneal the amplified fuzzy 
oligonucleotides in their correct orientation. An alternative PCR primer to Fuzl 2 termed 
Fuz25 (SEQ. ID No.25) was identical to Fuzl 2 except at its 5' end an additional codon 
was added prior to the stop codon to incorporate a C-terminal cysteine residue. This C- 
terminal cysteine in the expressed protein could be used subsequently to couple antigen 
to the C3dT, 



36 



31-12-97 14:28 
31 -DEC- 1997 14= 50 



D1223 360260 

iEDDIE AND GROSE 



P. 40 



R-067 Job-268 

01223 360280 P. 40 




37 



The PCR- primers are listed in the Table 2 below showing the fuzzy 
oligonucleotide pairs they are designed to amplify and their unique restriction site: 

Table 2 PCR primers designed to amplify fuzzy oligonucleotide overlapping pairs 



Fuzzy 


PCR primer 


Strand 


Novel 


oligonucleotide 






restriction site 


overlapping pairs 








Fuzl + Fuz2 


Fuz23 (SEQ ID No. 23) 


forward 


None 




Fuz20 (SEQ ID No. 20) 


reverse 


PvuU 


Fuz3 + Fuz4 


Fuzl9(SEQIDNo. 19) 


forward 


Pvuft 




Fuzl4(SEQIDNo. 14) 


reverse 


Xhol 


Fuz5 + Fuz6 


Fuzl3(SEQID No. 13) 


forward 


Xho\ 




Fuzl6(SEQlDNo. 16) 


reverse 


Xmal 


Fuz7 + Fuz8 


Fuzl 5 (SEQ ID No. 15) 


forward 


Xmal 




Fuzl8(SEQlDNo. 18) 


reverse 


Ben 


Fuz9 + FuzlO 


Fuil7(SEQIDNo. 17) 


forward 


Bed 




Fuz22 (SEQ ID No. 22) 


reverse 


Hindm 


Fuzll + Fuzl 2 


Fuz21 (SEQ ID No. 21) 


forward 


HindUl 




Fuz24 (SEQ ID No. 24) 


reverse 


None 




Fuz25 (SEQ ID No. 25) 


reverse 


None 



The PCR reactions were all carried out under identical conditions for each 
reaction containing a single pair of overlapping fuzzy oligonucleotides. The PCR 
reaction was carried out in a total reaction volume of 0.6ml containing fuzzy 
oligonucleotides at a final concentration of InM and the PCR oligonucleotides at 2uM in 
the combinations as described in Table 2 above, to this was added 48p.l 2.5mM dNTPs, 
60|ul xlO Taq polymerase buffer (Promega), 48jul 25mM MgCl2, and 6p.l Taq polymerase 



37 



12-S7 14:28 01223 3EC2B0 P. 41 R-067 Job-26B 

DEC-1997 14:51 R^PIE AND GROSE 01223 360280 P. 41 



38 



(30 units) (Promega), sterile deionised water was added to give a final volume of 0.6ml. 
The temperature cycling was identical for all reactions: 94°C for 5min followed by 35 
cycles of 50°C/2min; 72°C/3 minutes; 94°C/1 minutes; this was followed by an 
incubation at 72°C for 7 minutes. 

The PCR reaction products were separated on a 1.3% (w/v) agarose gel and the 
amplified product at the correct size was excised from the gel and purified using either 
MERmaid Kit or MERmaid Spin Kit into 60j±l of sterile deionised water. 

The purified products from the PCR reaction were restriction endonuclease 
digested with the appropriate enzyme see Table 2 above to provide cohesive ends for 
ligation. Post restriction enzyme digest the DNA was buffer exchanged into 30jal sterile 
deionised water by purifying the DNA from solution using either MERmaid Kit or 
MERmaid spin Kit. 

These fragments were either ligated either pairwise or in in turners through their 
cohesive ends under standard conditions, e.g. sections Fuzl-2+3-4, 5-6+7-8, 9-10+1 1-12 
or Fuz 1-2+3-4+5-6 or Fuz 7-8+9-10+11-12, as described in General Methods Section. 
Post ligation the ligated fragments were buffer exchanged using a Gene- Spin kit. 

These ligated fragments were then digested with the appropriate restriction 
endonucleases for ligation into the vector (see Example 4 ) and then buffer exchanged 
from solution using a Gene-Spin kit. Alternatively the ligated fragments were further 
amplified under the same PCR conditions using a reaction mix containing 2uM of the 
appropriate PCR primers, 15ul of ligated DNA fragments, 4pl 2.5mM dNTPs, 5ul xlO 
Taq polymerase buffer (Promega), 4jxl 25mM MgCl2, deionised water to 48u.l and 2u,l 
Taq polymerase (10 units) (Promega). The amplified product was excised from the gel 
and purified. The punfied fragments were then digested with the appropriate restriction 
nuclease to allow ligation into the vector as detailed in Example 4 below. 

A fuzzy C3d gene is directly assembled into pBP66-01 (see below for 
construction) or pBakPak8 vector following endonuclease digestion of pBP66-01 or 
pBacPak8 with BglU and Eagl and purification of the vector band after separation on a 
0.7% agarose gel. The products of the PCR-amplified fuzzy oligonucleotide pairs 
products are digested to provide the cohesive ends between adjacent fuzzy fragments 
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Additionally the fragments containing the 5' and 3' ends of the fuzzy gene were digested 
with Bglll and Eagl respectively. The six restriction enzyme digested PCR products are 
purified by agarose gel electrophoresis and purified, then are li gated into the BglWEagl 
digested pBP66-01 or pBakPak8 fragments under standard conditions. The ligated 
vectors is then transformed into E. colt XL-1 blue competent cells. These vectors are 
termed are termed pBP66-lO to 19 (pBP66-0 Lderived) or pBP66-20 to 29 ( P BacPak8- 
derived). 



Example 4. Construction and Expression of Fuzzy Genes encoding (C3d)3 using 
variants of the C3d sequence 

(a) Construction of the pBP68-30 series: baculovirus vectors for (C3d)3 which are 
resistant to homologous recombination 

pBP68-30 through to pBP68-39 contain three different variants of the C3d 
sequence that have been demonstrated by repeated passage in insect cells to be resistant 
to homologous recombination, see General Methods Section (xi), by virtue of sequence 
variation introduced at the third base "wobble" position in the first and second C3d 
domains. The pBP68-30 series of plasmids were constructed in two steps. In the first 
step, pBP68-01 was digested with the restriction enzyme Sacl which cleaves once within 
each C3d domain. The 6725 base pairs vector fragment was purified and self-ligated to 
form pBP66-01. pBP66-01 contains a single C3d domain containing a signal peptide at 
the amino terminus and the Glu-Glu-Pbe tag at the carboxy terminus. pBP66d was 
transformed into XL-1 blue E. coli and DNA extracted from the resulting colonies 
analysed by restriction digest. 

In the second step pBP66-01 is digested with the restriction enzyme Bglll. Fuzzy 
C3d domains or fragments were assembled as described in Example 3(b). PCR 
fragments or full-length products or plasmid vectors into which such products have been 
subcloned were digested with BamUl and Bglll and DNA fragments of 936 base pairs 
encoding single fuzzy C3d domains were purified and ligated with this fragment. 
Smaller fragments of fuzzy C3d may also be generated in Example 3(b) and these are 
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digested with appropriate restriction enzymes to generate compatible cohesive ends to 
reconstruct a full length fuzzy C3d domain. 

The sequence of individual variants of the fuzzy C3d domains may be determined 
prior to this ligation following subcloning into a holding vector such as pBCOO-02 (see 
below) or the ligation may be carried out using uncharacterised and potentially 
heterogeneous mixtures of PCR products generated in Example 3 (b). 

The resulting plasmids are designated p BC66-10 to 19 where a single fuzzy C3d 
domain is cloned into pBCOO-02 (for sequencing or bacterial expression), pBP67-20 to 
pBP67-29 where the baculovirus expression vector contains one fuzzy C3d domain and 
one invariant C3d domain, or pBP68-30 to pBP68-39 where the baculovirus expression 
vector contains two different fuzzy C3d domains and one invariant C3d domain. 

As an alternative strategy, the plasmid pBP66-01 is subjected to site directed 
mutagenesis to introduce unique restriction enzyme sites corresponding to the restriction 
enzyme sites engineered into the termini of the PCR products representing sub-domains 
of fuzzy C3d described in Table 2 such that the PCR primer pair Fuzl3 and Fuzl4 (SEQ 
ID Nos. 13/14) are used to generate a novel Xhol site at position 1713 in pBC66-01, 
numbering according to SEQ ID No.32; the PCR primer pair Fuzl5/Fuzl6 (SEQ ID 
Nos. 15/1 6) are used to generate a novel Xmal site at position 1866 in pBC66-0l; the PCR 
primer pair Fuzl7/Fuzl8 (SEQ ID Nos. 17/1 8) are used to generate a novel BcU site at 
position 2060 in pBC66-01; the PCR pnmer pair Fuz21/Fuz22 (SEQ ID Nos.21/22) are 
used to generate a novel HindUl site at position 2222 in pBC66-01 . These restriction 
sites and a unique PvuU site present in the invariant sequence at position 1553 are utilised 
to excise specific fragments of the invariant C3d sequence. These are replaced with 
corresponding fuzzy C3d fragments generated in Example 3 (b) having compatible 
cohesive ends, identical amino acid coding sequence, but fuzzy DNA sequence at the 
third base "wobble" position as described in Example 3 (a). 

The resulting plasmids where the restriction sites are introduced into invariant 
C3d sequence are designated pBP66-02 (Xhol site), pBP66-03 {Xmal site), pBP66-04 
{Bctl site), pBP66-05 (Hin&W site). 
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(b) Expression of (C3d)3 using fuzzy gene variants of the C3d sequence in 

Baculovirus/Sf9 system 

Recombinant baculoviruses are generated from the plasmids constructed as 
described in Section (a), this example, and used to express the encoded polypeptide 
according to the methods described in General Methodology Sections (viii) to (x). 

(c) Purification and Characterisation of the (C3d)3 Expressed from the fuzzy 
variants 

The (C3d)3 protein is purified as described in Section (xiv) of General Methods. 
The purified protein is characterised biologically by the methods described in Section 
(xvii) and physically characterised as described in Sections (xv) and (xvi) (b-c) and its 
exact mass determined by mass spectrometry. N-terminal sequence is also determined. 

(b) Construction of the pBC66-10, pBC67-10, pB C 68-1 0,: E.coii ; vectors for 
(C3d)3 which are resistant to homologous recombination 

pBroc413 is digested with the restriction endonucleases Ndel and Pstl and the 
vector fragment isolated from a 0.9% (w/v) agarose gel. To create a new multi-cloning 
sites suitable for exchange of C3d variants between the P BakPak8 vector and a bacterial 
expression vector a new polylinker site described in SEQ ID no. 38 and 39 is ligated into 
the vector fragment. This new vector is called pBC00-02. 

The plasmids designated pBP66-lO to 19 encode one fuzzy C3d domain. The 
series pBP67-20 to 29 encode one fuzzy C3d variant and the native C3d sequence and 
pBP68-30 to 39 contain two fuzzy C3d variant and a native the C3d sequence. These 
plasmids are digested with the restriction endonucleases Bglll and Eagl and the C3d 
encoding fragments purified from a 1% (w/v) agarose gel. PBC00-02 is similarly cut 
with Bglll and Eagl and the larger fragment (vector fragment) purified from a 0.9% (w/v) 
agarose gel. 

The vector fragments and the C3d encoding fragments are ligated and 
transformed into E. coli XL-1 blue under standard condition and the clones isolated and 
screened by restriction mapping. The new plasmids are designated: pBC67 10 to 19 
encoding one fuzzy C3d domain and pBC67-20 to 29 encoding one fuzzy C3d variant 
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and the native C3d sequence and pBC68-30 to 39 containing two fuzzy C3d variant and a 
native the C3d sequence 

Example 5. Construction and expression of a cysteine-tailed C3d monomer or 
(C3d)3 

(a) Construction of pBP66-06 and pBP68-40 to pBP68-49 

The C-termmal cysteine is introduced by site direct mutagenesis of the plasmid 
pBP66-01 to form pBP66-06, and the pBP68-40 to pBP68-49 series of plasmids is 
constructed from the series pBP68-30 to pBP68-39. These plasmids, or a selected 
plasmid with a chosen variant of fuzzy C3d, arc subjected to site directed mutagenesis 
with oligonucleotides with the following sequence: 

CAGTGGATCTTGCTAGAGTTCTGCAGGATC (Seq ID 33) and GATCCTGCAG 
AACTCTAGCAAGATCCACTG (Seq ID 34). Site directed mutagenesis is carried out 
using a "QuickChange" kit obtained from Stratagene (Cambridge UK.) according to the 
manufacturer's protocols. The resulting plasmids, pBP66-06 and pBP68-40 to pBP68-49 
are transformed into competent E. coli XLl-blue purchased from Stratagene. Ampicillin 
resistant transformants are selected on LB-agar plates containing 50p.g/ml ampicillin. 
Plasmid DNA extracted from the resulting colonies is identified initially by restriction 
enzyme digest and confirmed by DNA sequencing- 

(b) Expression of cysteine-tailed C3d and (C3d)3 sequence in BacuIovimsVSf9 
system 

Recombinant baculoviruses are generated from the plasmids constructed as described in 
Section (a), this example, and used to express the encoded polypeptide according to the 
methods described in "General Methodology used in examples" Sections (viii) to (x). 



(c) Purification and Characterisation of the cysteine-tailed C3d and (C3d)3 
Expressed from the fuzzy variants in Sf9/bacnlovirus system 
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The (C3d)n-cys tailed protein is purified as described in Section (xiv) of General 



Methods, characterised biologically by the methods described in Section (xvii) and 



(d) Conjugation of C3d-cys or (C3d) 3 -cys with antigen 

Purified C3aVcys is treated with Tris (2-carboxyethyl) phosphine.HCl (TCEP) to 
remove any blocking moiety on the C-terminal cysteine. Thus, C3d n (lOuM; 1 ml) is 
mixed with TCEP (5mM in 50mM HEPES pH 4.5; 0 008 ml) and incubated at 20 to 
25 °C for about 1 8h to give solution A. Antigen is prepared typically by chemical 
attachment of a suitable linking group to the antigen followed by conjugation to the free 
cysteine on the (C3d)3 molecule. Antigen (20u.M in O.lM Trien pH 8.0; 1 .0ml) is mixed 
with 2-iminothiolane (lOmM in O.lM Trien pH 8.0; 0.01ml) and incubated at 20 to 25°C 
for lh. DTNB (lOOmM in 0. 1M Tnen pH 8.0; 0.01ml) is added and the mixture left for 
a further lh. The mixture is then buffer exchanged into O.lM Trien pH 8.0 (2.0ml) using 
Sephadex G25 (PD10; Pharmacia) to give solution B. Solution B (1 -0ml) is mixed with 
solution A (1 .0ml) and incubated at 20 to 25°C for 1 8 h optionally with concentration by 
ultrafiltration to give solution C. Solution C is optionally purified to remove unreacted 
materials, for example by size exclusion chromatography, and then formulated into final 
product. This formulation may involve buffer-exchange into a physiologically 
acceptable buffer, for example phosphate buffered salme, followed by sterile filtration, 
aliquoting and freezing or it may involve buffer-exchange into a suitable buffer for 
lyophilisation. 

Example 6. Construction of a vector for expression of an unspecified gene product 
fused to (C3d)3. 

In this example any of the above plasmids used for expression of C3d monomer or 
oligomers is modified by the addition of sequences specifying a range of commonly 
utilised restriction enzyme sites which are not found elsewhere in the plasmid. This is 
known as the polylinker, and is intended to provide convenient sites for insertion of an 



physically characterised as described in (xv) and (xvi) (b-c). 
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unspecified DNA fragment encoding a polypeptide which results in an in-frame fusion 
between the mature C3d protein or C3d oligomer, either at the amino terminus or the 
carboxy terminus. The sequence comprising the polylinker is provided in three formats 
where the restriction sites are shifted by one base to facilitate in frame fusions of the 
unspecified DNA fragments. 



Example 7. Expression and purification of C3d oligomers as Intein fusion proteins 

(a) Construction of pBP66-07 and pBP68-50 to pBP68-59. 

In this example, a form of C3d or C3d oligomers is expressed with a carboxy 
tenninal reactive thiolester allowing direct covalent coupling to the antigen of choice. It 
should be noted the natural C3d sequence normally contains a thiolester that is buried vn 
native C3 and is the site of attachment to antigen in C3b and its fragments. In the 
plasmid pSG.C3di.YL and its derivatives, the cysteine residue that is the biosynthetic 
precursor of the natural thiolester is mutated to prevent interference with disulphide bond 
formation in recombinant C3d proteins (Dempsey el al, Science 271, 348-350, 1996). In 
the present construct the thiolester is effectively moved to the C^terminus of the C3d 
domain. 

The carboxy terminal of C3d in the vector pBP66-01 or any of the pBP68-30 to 
pBP68-39 series is modified by site directed mutagenesis to introduce novel restriction 
sites Sapl and Pstl using oligonucleotides #50391 (Seq ID No 35) 
(CCAGCAGTGGCTCTTCCTGCTTCTG CAGGATC) and #50392 (Seq ID No 36) 
(GATCCTGCAGAAGCAGGAAGAGCCACT GCTGG). Site directed mutagenesis is 
carried out using a "QuickChange" kit obtained from Stratagene according to the 
manufacturer's protocols. The resulting plasmids, pBP66-07 and pBP68-50 to pBP68-59 
are transformed into competent E. coli XL 1 -blue purchased from Stratagene. AmpiciUin 
resistant transform ants are selected on LB-agar plates containing SOp-g/ml ampicillin. 
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Plasmid DNA extracted from the resulting colonies is identified initially by restriction 



b) Construction of pBP81d and pBP83-50 to pBP83-59 

pBP81-01 and pBP83-50 to pBP83-59 are plasmids in which the genes for intein 
and the chitin binding domain are cloned in frame downstream of the C3d-wild type or 
(C3d)3 in which one or more of the domains is fuzzy to prevent homologous 
recombination- The intein and chitin binding domain sequences are obtained from the 
commercially available plasmid pCYBl (New England Biolabs) by digestion with the 
enzymes Sapl and Pstl and purification of the 1560 base pair fragment, this is termed 
fragment 1 . pBP66-07 and pBP68-50 to pBP68-59 are digested with Sapl and Pstl and a 
fragment of 6718 base pairs is purified from pBP66-07 (Fragment 2) arid a fragment of 
8590 base pairs is purified from pBP68-50 to pBP68-59 (fragments 3a-3j). 

Fragments 1 and 2 are ligated using T4 DNA ligase to give pBP81-01. Fragments 
3a to 3j fragments are each ligated with fragment 1 using T4 DNA ligase to give pBP83- 
50 to pBP83-59. The ligated plasmids are transformed into competent E. coli XL 1 -blue 
purchased from Stratagene. Ampicillin resistant transform ants are selected on LB -agar 
plates containing 50fig/ml ampicillin. Plasmid DNA extracted from the resulting 
colonies is analysed by restriction endonuclease digestion and confirmed by DNA 
sequencing. 

(c) Expression of C3d oligomers fused to intein and chitin binding protein 

Recombinant baculo viruses are generated from the plasmids constructed as described in 
Section b) (this example) and used to express the encoded polypeptide according to the 
methods described in "General Methodology used in examples" Sections (viii) to (x). 



enzyme digest and confirmed by DNA sequencing. 



(d) Purification and cleavage of C3d n -intein fusion proteins to generate carboxy 
terminal reactive thiolester 
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The C3dn-intein^chitin binding domain fusion protein is purified from material 
expressed into the media by the Sf9 infected baculo virus cells. Prior to loading onto the 
chitin bead affinity column (New England Biolabs, Herts) the pH of the media is 
corrected to pH 8.0 by addition of 10M NaOH. The salt concentration of the media is 
measured by conductivity, usually about 0.08M and solid NaCl added to the media to 
give a final concentration of 0.5 M NaCl. 

The C3d n -intein -chitin binding domain fusion containing supernatant is loaded 
onto the chitin bead affinity matrix column and separation carried out according to the 
instructions in the 'IMPACT V kit (New England Biolabs, Herts, UK). The cleavage on 
the column utilises a protein splicing mechanism known as an intein and this undergoes a 
self cleavage reaction between the C-terrninal of the C3d„ and the N- terminal of the 
intein in the presence of a reducing agent e.g. DTT. As a result of this reaction the 
eluted C3d„ contains a reactive thioiester at its C-terminal. The resulting affinity 
purified protein should be at least 90% pure, if the purity is less than 90% prior to 
coupling to antigen it should be further purified using techniques described in General 
Methods Section (xiv). 

(e) Reaction with C3d(n)-thiolester with antigen. 

The activated thioiester produced by the cleavage at the C-terminal of C3dn by the 
intein mechanism (Section d above) is used to couple antigen to the molecule. The 
thioiester reacts with free nucleophiles on the surface of the antigen e.g. the e-amino 
group of lysine or under some circumstances with hydroxyl groups in the side chains of 
amino-acids such as serine and tyrosine or in the sugar groups of glycosylated proteins. 
Preferably the antigen is present in molar excess over the thioiester and is preferably 
added to the purified C3d n after elution from the column and during processing to 
remove the DTT (which acts as a competing nucleophile) and to co-concentrate the 
proteins. The formation of the antigen-C3d is monitored by mass spectrometry and/or gel 
electrophoresis to detect all the species in the population. The complexes containing at 
least one C3d and one antigen unit are separated by gel filtration or by separation through 
a size exclusion membrane that allows separation of the multimexs from the monomers. 



31-12-97 14:28 01223 360280 P 50 R-067 Job-268 

31-DEC-19S7 14=55 JUDIE AND GROSE 01223 360280 P. 50 




47 



Example 8. Construction of a fuzzy genes encoding species variants of (C3d)3 

All the above examples employed the murine C3d sequence. Using methodology 
described in Example 3, fuzzy C3d sequences are devised for any species including but 
not restricted to human, cat, rabbit, bovine, ovine, equine and goat to generate a semes of 
variants containing either a single C3d domain, or two independent fuzzy variant 
domains of C3d and a single domain of invariant DNA sequence. Subsequent 
modifications described in all later Examples are carried out in an identical manner. 

Where the sequence is known e.g. human (de Brum and Fey (1985) PNAS (USA) 
82:708-712) the Fuzzy oligonucleotides are generated directly; in cases where the 
sequence is unknown C3d hybridisation probes from either the human or mouse are used 
to identify and therefore clone and sequence C3d from the chosen species library - 



Seq ID No 1 : DNA sequence of Fuzl 

CACCCGAGCCATATGAGATCTACNCCWGCNGGNTCHGGNGARCARAAYATG 
ATYGGNATGACNCCWACNGTNATYGCNGTNCACTACCTGGACCAGACCG 

Seq ID No2: DNA sequence of Fuz2 

GGCCAGCTGCTGGGTRTANCCYTTYTTRATVAGYTCVAGNGCYTCYTGRCGY 
TTYTCRATNCCRAAYTTYTCCCAYTGYTCGGTCTGGTCCAGG 

Seq ID No 3: DNA sequence of Fuz3 

CCCAGCAGCTGGCCTTYAARCARCCWTCHTCHGCNTAYGCNGC^^^TYAAY^ 
YCGYCCWCCWTCHACNTGGCTBACNGCCTACGTGGTCAAGG 

Seq ID No 4: DNA sequence of Fuz4 
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CCGGCTTCTGCTTCTCCAGRATVAGCCAYTTNACNGCNCCRCAVAGNACGTG 
DGARTCRATNGCRATVAGRTTNGCNGCVAGDGARAANACCTTGACCACGTAG 

GC 

Seq ID No 5: DNA sequence of Fuz5 

GGAGAAGCAGAAGCCGGAYGGNGTNTTYCARGARGAYGGNCCWGTNATYCA 

CCARGARATGATYGGNGGNTTYCGNAACGCCAAGGAGGC 

AGATG 

Seq ID No 6: DNA sequence of Fuz6 

GCTCCCAGGAAGGCTRTTNACYTGNCCYTCRCARATRTCRCGNGCYTCYTGV 
AGNGCRATVAGNACRAANGCNGTVAGDGANACATCTGCCTCCTTGGCG 

Seq ID No 7: DNA sequence of Fuz7 

GCCTTCCTGGGAGCATYAAYAARGCNGGNGARTAYATYGARGCNTCHTAYAT 
GAAYCTBCARCGYCCWTAYACNGTNGCNATYGCNGGGTATGCCCTGGCC 

Seq ID No 8: DNA sequence of Fuz8 

CTGGTCAGGCTCCTCCCARCGRTTRCGRTCYTTNGCNGTRTTVAGRAAYTTNC 
CNAGRTAWGGYTCYTCVAGYTTRTTCATVAGGGCCAGGGCATACCC 

Seq ID No9: DNA sequence of Fuz9 

GAGGAGCCTGACCAGCARCTBTAYAAYGTNGARGCNACNTCHTAYGCNCTBC 
TBGCNCTBCTBCTBCTBAARGAYTTYGAYTCHGTGCCCCCTGTAGTGC 

Seq ID No 10: DNA sequence of FuzlO 

GGGCCAAGGCTTGGAANACCATRAANGTNGCYTGNGTDGANCCRTANCCNC 
CNCCRTARTARCGYTGYTCRTTVAGCCANCGCACTACAGGGGGC 



Seq ID No 11: DNA sequence of Fuzl 1 
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CCAAGCCTTGGCCCARTAYCARACNGAYGTNCCWGAYCACAARGAYCTNAA 
YATGGAYGTNTCCTTCCACCTCCCC 

Seq ID No 12: DNA sequence of Fuzl2. 

CCCAGAGCCGGCCGGTTATCAGGATCCDGANCCNCCNCCNCCDGANCCNCCN 
CCNCCDGANCCDGADGAGGGGAGGTGGAAGG 

Seq ID No 1 3 : DNA sequence of Fuzl 3 
GGC TGA TTC TCG AGA AGC AGA AGC 

Seq ID No 14: DNA sequence of Fuzl 4 
GCT TCT GCT TCT CGA GAA TCA GCC 

Seq ID No 15: DNA sequence of Fuzl 5 

GCC TTC CCG GGA GCA TCA ACA AGG C 

Seq ID No 16: DNA sequence of Fuzl 6 
GCC TTG TTG ATG CTC CCG GGA AGG C 

Seq ID No 17: DNA sequence of Fuzl 7 

GGA GCC TGA TCA GCA GCT CTA CAA CG 

Seq ID No 18: DNA sequence of Fuzl 8 

CGT TGT AGA GCT GCT GAT CAG GCT CC 

Seq ID No 19: DNA sequence of Fuzl 9 
GGGTACACCCAGCAGCTGGCC 



Seq ID No 20: DNA sequence of Fuz20 
GGCCAGCTGCTGGGTGTACCC 



31-12-97 U:28 D 1 223 3EC260 F 53 R-Q67 Job-268 

31 -DEC -1997 14:55 R^IE RND GROSE 01223 360280 P. 



50 



Seq ID No 21 : DNA sequence of Fuz21 
GGTGTTCCAAGCTTTGGCCC 



Seq ID No 22; DNA sequence of Fuz22 
GGGCCAAAGCTTGGAACACC 



Seq ID No 23: DNA sequence of Fuz23 
C ACCCG AGC C AT ATGAG 

SEQ ID No 24: DNA sequence of Fuz24 
CCCAGAGCCGGCCGGTTATCAGGATCC 

SEQ ID No 25 DNA sequence of Fuz25 

CCC AGA GCC GGC CGG TTA GCA GGA TCC 



SEQ ID No26 



C3dl 


amino acid 


. sequence 


expressed 


in E.coli 


1 


Met 


Ala 


Eer 


Gly 


Ser 


Thr 


Pro 


Ala 


Gly 


Ser 


11 


Gly 


Glu 


Gin 


Agii 


Met 


He 


Gly Met 


Thr 


pro 


21 


Thr 


Val 


He 


Ala 


Val 


His 


Tyr 


Leu 


Asp 


Gin 


31 


Thr 


Glu 


Gin 


Trp 


Glu 


Lys 


Phe 


Gly 


He 


Glu 


41 


Lye 


Arg 


Gin 


Glu 


Ala 


Leu 


Glu 


Leu 


He 


Lys 


51 


Lys 


Gly 


Tyr 


Thr 


Gin 


Gin 


Leu 


AlS 


Phe 


Lys 


6i 


Gin 


Pro 


Ser 


Ser 


Ala 


Tyr 


Ala 


Ala 


Phe 


Asn 


71 


Asn 


Arg 


Pro 


Pro 


Ser 


Thr 


Trp 


Leu 


Thr 


Ala 


81 


Tyr 


Val 


val 


Lys 


Val 


Phe 


Ser 


Leu 


Ala 


Ala 


91 


Gin 


Leu 


He 


Ala 


He 


Asp 


Ser 


His 


val 


Leu 


101 


Cys 


Gly 


Ala 


val 


Lys 


Trp 


Leu 


lie 


Leu 


Glu 


111 


Lys 


Gin 


Lys 


pro 


Asp 


Gly val 


Phe 


Gin 


Glu 


121 


Asp 


Gly 


Pro 


val 


He 


His 


Gin 


Glu 


Met 


He 


131 


Gly 


Gly 


Phe 


Arg 


Asn 


Ala 


Lys 


Glu 


Ala 


Asp 


141 


Val 


ser 


Leu 


Thr 


Ala 


Phe 


Val 


Leu 


He 


Ala 


151 


Leu 


Gin 


GlU 


Ala 


Arg 


Asp 


He 


Cys 


Glu 


Gly 
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161 


Gin 


Val 


Asn 


ser 


171 


Lye 


Ala 


Gly 


Glu 


181 


Met 


Asn 


Leu 


Gin 


191 


lie 


Ala 


Gly 


Tyr 


201 


Lys 


Leu 


Glu 


Glu 


211 


Leu 


Asn 


Thr 


Ala 


221 


Glu 


GlU 


Pro 


Asp 


^ ~j -k. 


Glu 


Ala 


Thr 


Ser 


241 


Leu 


Leu 


Leu 


Lys 


251 


Pro 


Val 


Val 


Arg 


261 


Tyr 


Tyr 


Gly 


Gly 


271 


Ala 


Thr 


Phe 


Met 


281 


Gin 


Tyr 


Gin 


Thr 


291 


Leu 


Asn 


Met 


Asp 


301 


Ser 


Ser 


Gly 


Ser 



Leu Pro Gly Ser He Asn 
Tyr He Glu Ala Ser Tyr 
Arg Pro Tyr Thr Val Ala 
Ala Leu Ala Leu Met Asn 
Pro Tyr Leu Gly Lye Phe 
Lys Asp Arg Asn Arg Trp 
Gin Gin Leu Tyr Asn Val 
Tyr Ala Leu Leu Ala Leu 
ASp Phe Asp Ser Val Pro 
Trp Leu Asn Glu Gin Arg 
Gly Tyr Gly Ser Thr Gin 
Val Phe Gin Ala Leu Ala 
Asp Val Pro Asp His Asp 
Val Ser Phe His Leu Pro 
Glu Glu Phe 



SEQ ID No 27 

(C3d>3 amino acid sequence expressed in B.coli 



1 


Met 


Ala 


Ser 


Gly 


Ser 


Thr 


pro 


Ala 


Gly 


Ser 


11 


Gly 


Glu 


Gin 


Asn 


Met 


He 


Gly 


Met 


Thr 


pro 


21 


Thr 


Val 


He 


Ala 


Val 


His 


Tyr 


Leu 


Asp 


Gin 


31 


Thr 


Glu 


Gin 


Trp 


Glu 


Lys 


Phe 


Gly 


He 


Glu 


41 


Lys 


Arg 


Gin 


Glu 


Ala 


Leu 


Glu 


Leu 


He 


Lys 


52 


Lys 


Gly 


Tyr 


Thr 


Gin 


Gin 


Leu 


Ala 


Phe 


Lys 


62 


Gin 


Pro 


Ser 


Ser 


Ala 


Tyr 


Ala 


Ala 


Phe 


Asn 


72 


Asn 


Arg 


Pro 


Pro 


Ser 


Thr 


Trp 


Leu 


Thr 


Ala 


82 


Tyr 


Val 


Val 


Lys 


Val 


Phe 


Ser 


Leu 


Ala 


Ala 


91 


Gin 


Leu 


lie 


Ala 


He 


Asp 


Ser 


His 


val 


Leu 


102 


Cys 


Gly 


Ala 


Val 


Lys 


Trp 


Leu 


He 


Leu 


GlU 


112 


Lys 


Gin 


Lys 


Pro 


Asp 


Gly 


val 


Phe 


Gin 


Glu 


122 


ASP 


Gly 


Pro 


val 


lie 


His 


Gin 


Glu 


Met 


He 


132 


Gly 


Gly 


Phe 


Arg 


Asn 


Ala 


Lys 


Glu 


Ala 


Asp 


141 


Val 


Ser 


Leu 


Thr 


Ala 


Phe 


val 


Leu 


He 


Ala 


152 


Leu 


Gin 


Glu 


Ala 


Arg 


ASP 


He 


Cys 


Glu 


Gly 


162 


Gin 


Val 


Asn 


Ser 


Leu 


Pro 


Gly 


Ser 


He 


Asn 
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172 


Lys 


Ala 


Gly 


Glu 


182 


Met 


Asn 


Leu 


Gin 


191 


He 


Ala 


Gly 


Tyr 


202 


Lys 


Leu 


Glu 


Glu 


212 


Leu 


Asn 


Thr 


Ala 


222 


Glu 


Glu 


Pro 


ASp 


232 


Glu 


Ala 


Thr 


Ser 


241 


Leu 


Leu 


Leu 


Lys 


252 


Pro 


Val 


Val 


Arg 


262 


Tyr 


Tyr 


Gly 


Gly 


272 


Ala 


Thr 


Phe 


Met 


262 


Gin 


Tyr 


Gin 


Thr 


292 


Leu 


Asn 


Met 


Asp 


301 


Ser 


Ser 


Gly 


Ser 


311 


Gly 


Gly 


Gly 


ser 


321 


Ser 


Gly 


GlU 


Gin 


331 


Pro 


Thr 


Val 


He 


341 


Gin 


Thr 


Glu 


Gin 


351 


Glu 


Lys 


Arg 


Gin 


361 


Lys 


Lys 


Gly 


Tyr 


371 


Lys 


Gin 


Pro 


ser 


381 


Asn 


Asn 


Arg 


Pro 


391 


Ala 


Tyr 


Val 


Val 


401 


Ala 


Gin 


Leu 


He 


411 


Leu 


Cys 


Gly 


Ala 


421 


Glu 


Lys 


Gin 


Lys 


431 


Glu 


Asp 


Gly 


Pro 


441 


He 


Gly 


Gly 


Phe 


451 


Asp 


Val 


Ser 


Leu 


461 


Ala 


Leu 


Gin 


Glu 


471 


Gly 


Gin 


Val 


Asn 


481 


Aen 


Lys 


Ala 


Gly 


491 


Tyr 


Met 


Asn 


Leu 


501 


Ala 


He 


Ala 


Gly 


511 


Asn 


Lys 


Leu 


GlU 


521 


Phe 


Leu 


Asn 


Thr 



Tyr He Glu Ala Ser Tyr 
Arg Pro Tyr Thr Val Ala 
Ala Leu Ala Leu Met Asn 
Pro Tyr Leu Gly Lys Phe 
Lys Asp Arg Aen Arg Trp 
Gin Gin Leu Tyr ASH Val 
Tyr Ala Leu Leu Ala Leu 
Asp Phe Asp Ser Val Pro 
Trp Leu Asn Glu Gin Arg 
Gly Tyr Gly Ser Thr Gin 
Val Phe Gin Ala Leu Ala 
Asp Val Pro Asp His Asp 
Val Ser Phe His Leu Pro 
Gly Gly Gly Gly ser Gly 
Gly Ser Thr Pro Ala Gly 
Asn Met He Gly Met Thr 
Ala Val His Tyr Leu Asp 
Trp Glu Lys Phe Gly He 
Glu Ala Leu Glu Leu He 
Thr Gin Gin Leu Ala Phe 
Ser Ala Tyr Ala Ala Phe 
Pro Ser Thr Trp Leu Thr 
Lys Val Phe ser Leu Ala 
Ala He Asp Ser His Val 
Val Lys Trp Leu He Leu 
Pro Asp Gly Val Phe Gin 
Val He His Gin Glu Met 
Arg Asn Ala Lys Glu Ala 
Thr Ala Phe Val Leu He 
Ala Arg Asp He Cys Glu 
Ser Leu Pro Gly Ser He 
Glu Tyr He Glu Ala Ser 
Gin Arg Pro Tyr Thr Val 
Tyr Ala Leu Ala Leu Met 
Glu Pro Tyr Leu Gly Lys 
Ala Lys Asp Arg Asn Arg 
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53 



531 

541 

551 

561 

571 

581 

531 

601 

611 

621 

631 

641 

651 

661 

671 

681 

691 

701 

711 

721 

731 

741 

7S1 

761 

771 

781 

791 

BOl 

811 

821 

831 

841 

851 

861 

871 

881 



Trp Glu Glu Pro Asp Gin Gin Leu Tyr Asn 
Val Glu Ala Thr Ser Tyr Ala Leu Leu Ala 
Leu Leu Leu Leu Lys Asp Phe Asp Ser Val 
Pro Pro Val Val Arg Trp Leu Asn Glu Gin 
Are Tyr Tyr Gly Gly Gly Tyr Gly Ser Thr 
Gin Ala. Thr Phe Met Val Phe Gin Ala Leu 
Ala Gin Tyr Gin Thr Asp Val Pro Asp His 
Asp LCU Asn Met Asp Val Ser Phe His Leu 
Pro Ser Ser Gly ser Gly Gly Gly Gly Ser 
Gly Gly Gly Gly Ser Gly Ser Thr Pro Ala 
Gly Ser Gly Glu Gin ASH Met lie Gly Met 
Thr Pro Thr Val lie Ala Val His Tyr Leu 
Asp Gin Thr Glu Gin Trp Glu Lys Phe Gly 
He Glu Lys Arg Gin Glu Ala Leu Glu Leu 
He Lys Lys Gly Tyr Thr Gin Gin Leu Ala 
Phe Lys Gin Pro Ser Ser Ala Tyr Ala Ala 
Phe Asn Asn Arg Pro Pro Ser Thr Trp Leu 
Thr Ala Tyr Val Val Lys Val Phe Ser Leu 
Ala Ala Gin Leu lie Ala He Asp Ser His 
Val Leu Cys Gly Ala Val Lys Trp Leu He 
Leu Glu Lys Gin Lys Pro Asp Gly Val Phe 
Gin Glu Asp Gly Pro Val He His Gin Glu 
Met lie Gly Gly Phe Arg Asn Ala Lys Glu 
Ala Asp val Ser Leu Thr Ala Phe Val Leu 
He Ala Leu Gin Glu Ala Arg Asp He Cys 
Glu Gly Gin Val Asn Ser Leu Pro Gly Ser 
He Asn Ly 3 Ala Gly Glu Tyr He Glu Ala 
Ser Tyr Met Asn Leu Gin Arg Pro Tyr Thr 
Val Ala He Ala Gly Tyr Ala Leu Ala Leu 
Met Asn Lys Leu Glu Glu Pro Tyr Leu Gly 
Lys Phe Leu Asn Thr Ala Lys Asp Arg Asn 
Arg Trp Glu Glu Pro Asp Gin Gin Leu Tyr 
Asn Val Glu Ala Thr Ser Tyr Ala Leu Leu 
Ala Leu Leu Leu Leu Lys Asp Phe Asp Ser 
Val Pro Pro Val Val Arg Trp Leu Aim GlU 
Gin Arg Tyr Tyr Gly Gly Gly Tyr Gly Ser 
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891 Thr Gin 

901 Leu Ala 

911 Hi3 Asp 

921 Leu Pro 



Ala 


Thr 


Phe 


Met 


Gin 


Tyr 


Gin 


Thr 


Leu 


Asn 


Met 


Asp 


ser 


Ser 


Gly 


Ser 



val Phe Gin Ala 
Asp Val Pro Asp 
val ser Phe His 
Glu Glu Phe 



SEQ. ID No 28 
TATGGCTAGCG 



SEQ. ID No 29 
ACCGATCGCCTAG 

Seq ID No 30: DNA sequence of PCR forward primer #39171 
GAATTCCTAGCTTGCTTG 

Seq ID No 31 : DNA sequence of PCR reverse primer #39172 
TCTAGAGTCGACCAGAC 



Seq ID No 32 DNA sequence of pBP66-01 



1 


AACGGCTCCG 


CCCACTATTA 


AT GAAATTAA 


AAATTCCAAT 


TTTAAAAAAC 


50 


51 


GCAGCAAGAG 


AAACATTTGT 


ATGAAAGAAT 


GCGTAGAAGG 


AAAGAAAAAT 


100 


101 


GTCGTCGACA 


TGCTGAACAA 


CAAGATTAAT 


ATGCCTCCGT 


GTATAAAAAA 


150 


151 


AATATTGAAC 


GATTTGAAAG 


AAAACAATGT 


ACCGCGCGGC 


GGTATGTACA 


200 


201 


GGAAGAGGTT 


TATACTAAAC 


TGTTACATTG 


CAAACGT GGT 


TTCGTGTGCC 


250 


251 


AAGTGTGAAA 


ACCGATGTTT 


AATCAAGGCT 


CTGACGCATT 


TCTACAACCA 


300 


301 


CGACTCCAAG 


TGTGTGGGTG 


AAGTCATGCA 


TCTTTTAATC 


AAATCCCAAG 


350 


351 


ATGTGTATAA 


ACCACCAAAC 


TGCCAAAAAA 


TGAAAACTGT 


CGACAAGCTC 


400 


401 


TGTCCGTTTG 


CTGGCAACTG 


CAAGGGTCTC 


AATCCTATTT 


GTAATTATTG 


450 


451 


AATAATAAAA 


CAATTATAAA 


TGCTAAATTT 


GTTTTTTATT 


AAC G AT AC AA 


500 


501 


ACCAAACGCA 


ACAAGAACAT 


TTGTAGTATT 


ATCTATAATT 


GAAAACGCGT 


550 
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551 AGTTATAATC GCTGAGGTAA TATTTAAAAT CATTTTCAAA T GAT T C AC AG 600 

601 TTAATTTGCG ACAATATAAT TTTATTTTCA CATAAACTAG ACGCCTTGTC 65 0 

651 GTCTTCTTCT TCGTATTCCT TCTCTTTTTC ATTTTTCTCC TCATAAAAAT 700 

701 TAACATAGTT ATTATCGTAT CCATATATGT ATCTATCGTA TAGAGTAAAT "7 5 0 

7 51 TTTTTGTTGT CATAAATATA TATGTCTTTT TTAATGGGGT GTATAGTACC 8 00 

801 GCTGCGCATA GTTTTTCTGT AATTTACAAC AGTGCTATTT TCTGGTAGTT 850 

851 CTTCGGAGTG TGTTGCTTTA AT TATTAAAT TTATATAATC AATGAATTTG 90 0 

901 GGATCGTCGG TTTTGTACAA TATGTTGCCG GCATAGTACG CAGCTTCTTC 950 

951 TAGTTCAATT ACACCATTTT TTAGCAGCAC CGGATTAACA TAACTTTCCA 1000 

1001 AAATGTTGTA CGAACCGTTA AACAAAAACA GTTCACCTCC CTTTTCTATA 1050 

1051 CTATTGTCTG CGAGCAG TTG TTTGTTGTTA AAAATAACAG CCATTGTAAT 1100 

1101 GAGACGCACA AACTAATATC ACAAACTGGA AATGTCTATC AATATATAGT 1150 

1151 TGCTGATATC ATGGAGATAA TTAAAATGAT AACCATCTCG CAAATAAATA 1200 

1201 AGTATTTTAC TGTTTTCGTA ACAGTTTTGT AATAAAAAAA CCTATAAATA 1250 

12 51 CGGATCCCG GGAATTCCTAG CTTGCTTGTT CTTTTTGCAG AAGCTCAGAA 1300 

1301 TAAACGCTC AACTTTGGCCA TGGCCCTCTG GATGCGCCTC CTGCCCCTGC 1350 

1351 TGGCCCTGC TGGCCCTCTGG GCGCCCGCGC CCACCCGAGC CGGATCCAGA 14 00 

14 01 TCTACCCCC GCAGGCTCTGG GGAACAGAAC ATGATTGGCA TGACACCAAC 14 50 

1451 AGTCATTGC GGTACACTACC TGGACCAGAC CGAACAGTGG GAGAAGTTCG 1500 

1501 G CAT AG AG A AGAGGCAAGAG GCCCTGGAGC TCATCAAGAA AGGGTACACC 1550 

1551 CAGCAGCTG GCCTTCAAACA GCCCAGCTCT GCCTATGCTG CCTTCAACAA 160 0 

1601 CCGGCCCCC CAGCACCTGGC TGACAGCCTA CGTGGTCAAG GTCTTCTCTC 1650 

1651 TAGCTGCCA ACCTCATCGCC ATCGACTCTC ACGTCCTGTG TGGGGCTGTT 17 00 

1701 AAATGGTTG ATTCTGGAGAA ACAGAAGCCG GATGGTGTCT TTCAGGAGGA 17 5 0 

1751 TGGGCCCGT GATTCACCAAG AAATGATTGG TGGCTTCCGG AACGCCAAGG 1800 

1801 AGGCAGATGT GTCACTCACA GCCTTCGTCC TCATCGCACT GCAGGAAGCC 18 50 

1851 AGGGACATCT GTGAGGGGCA GGTCAATAGC CTTCCTGGGA GCATCAACAA 1900 

1901 GGCAGGGGAG TATATTGAAG CCAGTTACAT GAACCTGCAG AG AC CAT AC A 1950 

1951 CAGTGGCCAT TGCTGGGTAT GCCCTGGCCC T GAT G AAC AA ACT G G AGGAA 2 000 

2001 CCTTACCTCG GCAAGTTTCT G AAC AC AG CC AAAGATCGGA ACCGCTGGGA 2050 

2 051 GGAGCCTGAC CAGCAGCTCT ACAACGTAGA GGCCACATCC TACGCCCTCC 2100 

2101 TGGCCCTGCT GCTGCTGAAA GACTTTGACT CTGTGCCCCC TGTAGTGCGC 2150 

2151 TGGCTCAATG AGCAAAGATA CTACGGAGGC GGCTATGGCT CCACCCAGGC 2200 

2201 TACCTTCATG GTATTCCAAG CCTTGGCCCA ATATCAAACA GATGTCCCTG 2250 

2251 ACCATAAGGA CTTGAACATG GATGTGTCCT TCCACCTCCC CAGCAGTGGA 2300 

2301 TCTGGTGGAG GTGGAAGTGG TGGAGGTGGA GAAGAGT TCT GAGGATCTTA 2350 

2351 TTAAAGCAGA ACTTGTTTAT TGCAGCTTAT AATGGTTACA AATAAAGCAA 24 00 

24 01 TAGCATCACA AATTTCACAA AT AAAGCAT T TTTTTCACTG CATTCTAGTT 24 50 
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24 51 GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG GTCGACTCTA 2500 

2501 GATTCGAACT CGAGGCCTGC AGGGCGGCCG CTTAATTAAT TGATCCGGGT 2550 

2551 TATTAGTACA TTTATTAAGC GCTAGATTCT GTGCGTTGTT GATTTACAGA 2 600 

2601 CAATTGTTGT ACGTATTTTA ATAATTCATT AAATTTATAA TCTTTAGGGT 2650 

2651 GGTATGTTAG AGCGAAAATC AAATGATTTT CAGCGTCTTT ATATCTGAAT 2700 

2701 TTAAATATTA AATCCTCAAT AGATTTGTAA AATAGG7TTC GATTAGTTTC 2750 

27 51 AAACAAGGGT TGTTTTTCCG AACCGATGGC TGGACTATCT AATGGATTTT 2 800 

2801 CGCTCAACGC CACAAAACTT GCCAAATCTT GTAGCAGCAA TCTAGCTTTG 2850 

2851 TCGATATTCG TTTGTGTTTT GTTTTGTAAT AAAGGTTCGA CGTCGTTCAA 2900 

2 901 AATATTATGC GCTTTTGTAT TTCTTTCATC ACTGTCGTTA GTGTACAATT 2950 

2951 GACTCGACGT AAACACGTTA AATAAAGCTT GGACATATTT AACATCGGGC 3000 

3001 GTGTTAGCTT TATTAGGCCG ATTATCGTCG TCGTCCCAAC CCTCGTCGTT 3050 

3051 AGAAGTTGCT TCCGAAGACG ATTTTGCCAT AGCCACACGA CGCCTATTAA 3100 

3101 TTGTGTCGGC TAACACGTCC GCGATCAAAT TTGTAGTTGA GCTTTTTGGA 3150 

3151 ATTATTTCTG ATTGCGGGCG TTTTTGGGCG GGTTTCAATC TAACTGTGCC 3200 

3201 CGATTTTAAT T C AG AC AAC A CGT TAGAAAG CGATGGTGCA GGCGGTGGTA 3250 

32 51 ACATTTCAGA CGGCAAATCT ACTAATGGCG GCGGTGGTGG AGCTGATGAT 3300 

3301 AAATCTACCA TCGGTGGAGG CGCAGGCGGG GCTGGCGGCG GAGGCGGAGG 3350 

3351 CGGAGGTGGT GGCGGTGATG CAGACGGCGG TTTAGGCTCA AATGTCTCTT 34 00 

3401 TAGGCAACAC AGTCGGCACC TCAACT AT TG TACTGGTTTC GGGCGCCGTT 3450 

3451 TTTGGTTTGA CCGGTCTGAG ACGAGTGCGA TTTTTTTCGT TTCTAATAGC 3500 

3501 TTCCAACAAT TGTTGTCTGT CGTCTAAAGG TGCAGCGGGT TGAGGTTCCG 3550 

3551 TCGGCATTGG TGGAGCGGGC GGCAATTCAG ACATCGATGG TGGTGGTGGT 3600 

3601 GGTGGAGGCG CTGGAATGTT AGGCACGGGA GAAGGTGGTG GCGGCGGTGC 3650 

3651 CGCCGGTATA ATTTGTTCTG GTTTAGTTTG TTCGCGCACG ATTGTGGGCA 3700 

3701 CCGGCGCAGG CGCCGCTGGC TGCACAACGG AAGGTCGTCT GCTTCGAGGC 3750 

37 51 AGCGCTTGGG GTGGTGGCAA TTCAATATTA TAATTGGAAT ACAAATCGTA 38 00 

3801 AAAATCTGCT ATAAGCATTG TAATTTCGCT ATCGTTTACC GTGCCGATAT 3850 

3851 TTAACAACCG CTCAATGTAA GCAATTGTAT TGTAAAGAGA TTGTCTCAAG 3900 

3901 CTCGGATCGA TCCCGCACGC CGATAACAAG CCTTTTCATT TTTACTACAG 3950 

3951 CATTGTAGTG GCGAGACACT TCGCTGTCGT CGCCTGATGC GGTATTTTCT 4000 

4001 CCTTACGCAT CTGTGCGGTA TTTCACACCG CATACGTCAA AGCAACCATA 4050 

4 051 GTACGCGCCC TGTAGCGGCG CATTAAGCGC GGCGGGTGTG GTGGTTACGC 4100 

4101 GCAGCGTGAC CGCTACACTT GCCAGCGCCC TAGCGCCCGC TCCTTTCGCT 4150 

4 151 TTCTTCCCTT CCTTTCTCGC CACGTTCGCC GGCTTTCCCC GTCAAGCTCT 4 200 

4201 AAATCGGGGG CTCCCTTTAG GGTTCCGATT TAGTGCTTTA CGGCACCTCG 4250 

4251 ACCCCAAAAA ACTTGATTTG GGTGATGGTT CACGTAGTGG GCCATCGCCC 4300 

4 301 TGATAGACGG TTTTTCGCCC TTTGACGTTG GAGTCCACGT TCTTTAATAG 4 350 
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57 



4351 TGGACTCTTG TTCCAAACTG GAACAACACT CAACCCTATC TCGGGCTATT 4400 

4 4 01 CTTTTGATTT ATAAGGGATT TTGCCGATTT CGGCCTATTG GTTAAAAAAT 4 4 50 

4451 GAGCTGATTT AACAAAAATT TAACGCGAAT TTTAACAAAA TATTAACGTT 4500 

4501 TACAATTTTA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT 4550 

4551 TAAGCCAGCC CCGACAGCCG CCAACACCCG CTGACGCGCC CTGACGGGCT 4 600 

4 601 TGTCTGCTCC CGGCATCCGC TTACAGACAA GCTGTGACCG TCTCCGGGAG 4 650 

4 651 CTGCATGTGT CAGAGGTTTT CACCGTCATC ACCGAAACGC GCGAGACGAA 47 00 

4 701 AGGGCCTCGT GATACGCCTA TTTTTATAGG TTAATGTCAT GATAATAATG 47 50 

4 751 GTTTCTTAGA CGTCAGGTGG CACTTTTCGG GGAAATGTGC GCGGAACCCC 4800 

4 801 TATTTGTTTA TTTTTCTAAA TACAT TC AAA TATGTATCCG C T C AT G AG AC 48 50 

4 851 AATAACCCTG AT AAATGCT T CAATAAT AT T GAAAAAGGAA GAG TATGAGT 4 900 

4901 ATTCAACATT TCCGTGTCGC CATTATTCCC TTTTTTGCGG CATTTTGCCT 4950 

4951 TCCTGTTTTT GCTCACCCAG AAACGCTGGT GAAAGTAAAA GATGCTGAAG 5000 

5001 ATCAGTTGGG TGCACGAGTG GGTTACATCG AACTGGATCT CAACAGCGGT 5050 

5051 AAGATCCTTG AGAGTTTTCG CCCCGAAGAA CGTTTTCCAA TGATGAGCAC 5100 

5101 TTTTAAAGTT CTGCTATGTG GCGCGGTATT ATCCCGTATT GACGCCGGGC 5150 

5151 AAGAGCAACT CGGTCGCCGC ATACACTATT CTCAGAATGA CTTGGTTGAG 5200 

5201 TACTCACCAG TCACAGAAAA GCATCTTACG GATGGCATGA CAGTAAGAGA 5250 

5251 ATTATGCAGT GCTGCCATAA CCATGAGTGA TAACACTGCG GCCAACTTAC 5300 

5301 TTCTGACAAC GATCGGAGGA CCGAAGGAGC TAACCGCTTT TTTGCACAAC 5350 

5351 ATGGGGGATC ATGTAACTCG CCTTGATCGT TGGGAACCGG AGCTGAATGA 5400 

5401 AGCCATACCA AACGACGAGC GT GACACCAC GTGCCTGTAG CAATGGCAAC 5450 

5451 AACGTTGCGC AAACTATTAA CTGGCGAACT ACTTACTCTA GCTTCCCGGC 5500 

5501 AACAATTAAT AGACTGGATG GAGGCGGATA AAGTTGCAGG ACCACTTCTG 5600 

5 601 CGCTCGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAAAT CTGGAGCCGG 5 650 
5651 TGAGCGTGGG TCTCGCGGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC 
57 01 CCTCCCGTAT CGTAGTTATC TACACGACGG GGAGTCAGGC AACTATGGAT 
57 51 GAACGAAATA GACAGATCGC T GAG AT AG G T GCCTCACTGA TTAAGCATTG 

5801 GTAACTGTCA GACCAAGTTT ACTCATATAT ACTTTAGATT GATTTAAAAC 5850 

5851 TTCATTTTTA ATTTAAAAGG ATCTAGGTGA AGATCCTTTT TGATAATCTC 5900 

5901 AT GACCAAAA TCCCTTAACG TGAGTTTTCG TTCCACTGAG CGTCAGACCC 5950 

5951 CGTAGAAAAG ATCAAAGGAT CTTCTTGAGA TCCTTTTTTT CTGCGCGTAA 6000 

6001 TCTGCTGCTT GCAAACAAAA AAACCACCGC TACCAGCGGT GGTTTGTTTG 6050 

6051 CCGGATCAAG AGCTACCAAC TCTTTTTCCG AAGGTAACTG GCTTCAGCAG 6100 

6101 AGCGCAGATA CCAAATACTG TCCTTCTAGT GTAGCCGTAG TTAGGCCACC 6150 

6151 ACTTCAAGAA CTCTGTAGCA CCGCCTACAT ACCTCGCTCT GCTAATCCTG 62 0 0 
6201 TTACCAGTGG CTGCTGCCAG TGGCGATAAG TCGTGTCTTA CCGGGTTGGA 
6251 CTCAAGACGA TAGTTACCGG ATAAGGCGCA GCGGTCGGGC TGAACGGGGG 



5700 
5750 
5800 



6250 
6300 
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6301 


GTTCGTGCAC 


ACAGCCCAGC 


TTGGAGCGAA 


CGACCTACAC 


CGAACTGAGA 


6350 


\J X 


TArTTACAGC 


GTGAGCTATG 


AGAAAGCGCC 


ACGCTTCCCG 


AAGGGAGAAA 


6400 


6401 


GGCGGACAGG 


TATCCGGi AA 


GCGGCACjCjGI 


/— » 7\ A ^ 7v ^ A 

(j AAt Alj o A 




D H _J U 


6451 


GGGAGCTTCC 


AGGGGGAAAC 


GCCTGGTATC 


TTTATAGTCC 


TGTCGGGTTT 


6500 


6501 


CGCCACCTCT 


GACTTGAGCG 


TCGATTTTTG 


TGATGCTCGT 


CAGGGGGGCG 


6550 


6551 


GAGCCTATGG 


AAAAACGCCA 


GCAACGCGGC 


CTTTTTACGG 


TTCCTGGCCT 


6600 


6601 


TTTGCTGGCC 


TTTTGCTCAC 


ATGTTCTTTC 


CTGCGTTATC 


CCCTGATTCT 


6650 


6651 


GTGGATAACC 


GTATTACCGC 


CTTTGAGTGA 


GCTGATACCG 


CTCGCCGCAG 


6700 


6*701 


CCGAACGACC 


GAGCGCAGCG 


AGTCAGTGAG 


CGAGGAAGCG 


GAAGAGCGCC 


6750 


6751 


CAATACGCAA 


ACCGCCTCTC 


CCCGCGCGTT 


GGCCGATTCA 


TTAATGCAG 6799 





Seq ID No 33 

CAGTGGATCTTGCTAGAGTTCTGCAGOATC 



Scq ID No 34 

GATCCTTGCAG AACTCTAGCAAGATCCACTG 

Seq ID No 35 DNA sequence of #50391 
CCAGCAGTGGCTCTTCCTGCTTCTGCAGGATC 

Seq ID No 36 DNA sequence of #50392 

GATCCTGCAGAAGCAGGAAGAGCCACTGCTGG 

Seq ED No 37 Amino acid sequence of C3d-YL expressed in baculovims/Sf9 



1 


Met 


Ala 


Leu 


Trp 


Met 


Arg 


Leu 


Leu 


Pro 


Leu 


10 


Leu 


Ala 


Leu 


Leu 


Ala 


Leu 


Trp 


Ala 


Pro 


Ala 


21 


Pro 


Thr 


Arg 


Ala 


Gly 


Ser 


Arg 


Ser 


Thr 


Pro 


31 


Ala 


Gly 


Ser 


Gly 


Glu 


Gin 


Asn 


Met 


He 


Gly 


41 


Met 


Thr 


Pro 


Thr 


Val 


He 


Ala 


Val 


His 


Tyr 


51 


Leu 


Asp 


Gin 


Thr 


Glu 


Gin 


Trp 


Glu 


Lys 


Phe 


61 


Gly 


He 


Glu 


Lys 


Arg 


Gin 


Glu 


Ala 


Leu 


Glu 


71 


Leu 


He 


Lys 


Lys 


Gly 


Tyr 


Thr 


Gin 


Gin 


Leu 
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81 
91 
101 

111 

121 

131 

141 

151 

161 

171 

181 

191 

201 

211 

221 

231 

241 

251 

261 

271 

281 

291 

301 

311 

312 



Ala Phe Lys Gin Pro Ser Ser Ala Tyr Ala 
Ala Phe Asn Asn Arg Pro Pro Ser Thr Trp 
Leu Thr Ala Tyr Val Val Lys Val Phe Ser 
Leu Ala Ala Gin Leu He Ala He Asp Ser 
His Val Leu Cys Gly Ala Val Lys Trp Leu 
He Leu Glu Lys Gin Lys Pro Asp Gly Val 
Phe Gin Glu Asp Gly Pro Val He His Gin 
Glu Met He Gly Gly Phe Arg Asn Ala Lye 
Glu Ala Asp Val Ser Leu Thr Ala Phe Val 
Leu He Ala Leu Gin Glu Ala Arg Asp He 
Cys Glu Gly Gin val Asn ser Leu Pro Gly 
ser He Asn Lys Ala Gly Glu Tyr He Glu 
Ala Ser Tyr Met Asn Leu Gin Arg Pro Tyr 
Thr Val Ala He Ala Gly Tyr Ala Leu Ala 
Leu Met Asn Lys Leu Glu Glu Pro Tyrr Leu 
Gly Lys Phe Leu Asn Thr Ala Lys Asp Arg 
Asn Arg Trp Glu Glu Pro Asp Gin Gin Leu 
Tyr Asm Val Glu Ala Thr Ser Tyr Ala Leu 
Leu Ala Leu Leu Leu Leu Lys Asp Phe Asp 
Ser Val Pro Pro Val Val Arg Trp Leu Asn 
Glu Gin Arg Tyr Tyr Gly Gly Gly Tyr Gly 
Ser Thr Gin Ala Thr Phe Met Val Phe Gin 
Ala Leu Ala Gin Tyr Gin Thr Asp Val Pro 
Asp His Asp Leu Asn Met Asp Val Ser Phe 
His Leu Pro Ser Ser Gly Ser Glu Glu Phe 



Seq ED No 38 Oligonucleotides for modification of bacterial vector P Broc413 
TAT GAG ATC TCC CGG GGG ATC CTA GCG GCC GCT GCA 
Seq ID No 39 Oligonucleotides for modification of bacterial vector pBroc413 
GCG GCC GCT AGG ATC CCC CGG GAG ATC TCA 
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TOTAL P. 62 



